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31 Human Secreted Proteins 

Field of the Invention 



This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
their production. 



Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 
organelle, contains different proteins essential for the function of the organelle. The cell 
uses "sorting signals," which are amino acid motifs located within the protein, to target 
proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a leader 
sequence, directs a class of proteins to an organelle called the endoplasmic reticulum 
(ER). The ER separates the membrane-bounded proteins from all other types of 
protems. Once localized to the ER, both groups of proteins can be further directed to 
another organelle called the Golgi apparatus. Here, the Golgi distributes the proteins to 
vesicles, including secretory vesicles, the cell membrane, lysosomes, and the other 



Proteins targeted to the ER by a signal sequence can be released into the 
extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the extracellulai* 
space - a process called exocytosis. Exocytosis can occur constitutively or after receipt 
of a triggering signal. In the latter case, the proteins are stored in secretory vesicles (or 
secretory granules) until exocytosis is triggered. Similarly, proteins residing on the cell 
membrane can also be secreted into the extracellular space by proteolytic cleavage of a 
"linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins include 
the commercially valuable human insulin, interferon. Factor Vm, human growth 
hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of the 
pervasive role of secreted proteins in human physiology, a need exists for identifying 
and characterizing novel human secreted proteins and the genes that encode them. This 
knowledge will allow one to detect, to treat, and to prevent medical disorders by using 
secreted proteins or the genes that encode them. 



Background of the Invention 



organelles. 



m 
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Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, antibodies, 
5 and recombinant methods for producing the polypeptides and polynucleotides. Also 
provided are diagnostic methods for detecting disorders related to the polypeptides, and 
therapeutic methods for treating such disorders. The invention further relates to 
screening methods for identifying binding partners of the polypeptides. 

10 Detailed Description 

Definitions 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
15 environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. 
20 In the present invention, a "secreted" protein refers to those proteins capable of 

being directed to the ER, secretory vesicles, or the extracellular space as a result of a 
signal sequence, as well as those proteins released into the extracellular space without 
necessarily containing a signal sequence. If the secreted protein is released into the 
extracellular space, the secreted protein can undergo extracellular processing to produce 
25 a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

In specific embodiments, the polynucleotides of the invention are less than 300 
kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, or 7.5 kb in length. In a further 
embodiment, polynucleotides of the invention comprise at least 15 contiguous 
30 nucleotides of the coding sequence, but do not comprise all or a portion of any intron. 
In another embodiment, the nucleic acid comprising the coding sequence does not 
contain coding sequences of a genomic flanking gene (i.e., 5' or 3' to the gene in the 
genome). 

As used heiein , a "polynucleotide" refers to a molecule having a nucleic acid 
35 sequence contained in SEQ ID NO:X or the cDN A contained within the clone deposited 
with the ATCC. For example, the polynucleotide can contain the nucleotide sequence 
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of the full length cDN A sequence, including the 5' and 3' untranslated sequences, the 
coding region, with or without the signal sequence, the secreted protein coding region, 
as well as fragments, epitopes, domains, and variants of the nucleic acid sequence. 
Moreover, as used herein, a "polypeptide" refers lo a molecule having the translated 
5 amino acid sequence generated from the polynucleotide as broadly defined. 

In the present invention, the full length sequence identified as SEQ ID NO:X 
was often generated by overlapping sequences contained in multiple clones (contig 
analysis). A representative clone containing all or most of the sequence for SEQ ID 
NO:X was deposited with the American Type Culture Collection ("ATCC"). As 

10 shown in Table 1 , each clone is identified by a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 10801 University Boulevard, 
Manassas, Virginia 201 10-2209, USA. The ATCC deposit was made pursuant to the 
terms of the Budapest Treaty on the international recognition of the deposit of 
microorganisms for purposes of patent procedure. 

15 A "polynucleotide" of the present invention also includes those polynucleotides 

capable of hybridizing, under stringent hybridization conditions, to sequences contained 
in SEQ ID NO:X, the complement thereof, or the cDNA within the clone deposited with 

the ATCC. "Stringent hybridization conditions" refers to an overnight incubation at 42^ 

C in a solution comprising 50% formamide, 5x SSC (750 mM NaCl, 75 mM sodium 
20 citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran 

sulfate, and 20 \ig/m\ denatured, sheared salmon sperm DNA, followed by washing the 

filters in 0.1 X SSC at about 65°C. 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
25 Changes in the stringency of hybridization and signal detection are primarily 

accomplished through the manipulation of formamide concentration (lower percentages 
of formamide result in lowered stringency); salt conditions, or temperature. For 

example, lower stringency conditions include an overnight incubation at 37®C in a 

solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH2P04; 0.02M EDTA, 
30 pH 7.4). 0.5% SDS, 30% formamide, 100 ug/ml salmon sperm blocking DNA; 

followed by washes at SO'^C with IXSSPE, 0.1% SDS. In addition, to achieve even 

lower stringency, washes performed following stringent hybridization can be done at 
higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
35 inclusion and/or substitution of alternate blocking reagents used to suppress 
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background in hybridization experiments. Typical blocking reagents include 
Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
commercially available proprietary fomiulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, due 
5 to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such 
as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 

10 molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone). 

The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA 
or modified RNA or DNA. For example, polynucleotides can be composed of single- 

15 and double-stranded DNA, DNA that is a mixture of single- and double-stranded 
regions, single- and double-stranded RNA, and RNA that is mixture of single- and 
double-stranded regions, hybrid molecules comprising DNA and RNA that may be 
single-stranded or, more typically, double-stranded or a mixture of single- and double- 
stranded regions. In addition, the polynucleotide can be composed of triple-stranded 

20 regions comprising RNA or DNA or both RNA and DNA. A polynucleotide may also 
contain one or more modified bases or DNA or RNA backbones modified for stability 
or for other reasons. "Modified" bases include, for example, tritylated bases and 
unusual bases such as inosine. A variety of modifications can be made to DNA and 
RNA; thus, "polynucleotide" embraces chemically, enzymatically, or metabolically 

25 modified forms. 

The polypeptide of the present invention can be composed of amino acids joined 
to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and 
may contain amino acids other than the 20 gene-encoded amino acids. The 
polypeptides may be modified by either natural processes, such as posttranslational 

30 processing, or by chemical modification techniques which are well known in the art. 
Such modifications are well described in basic texts and in more detailed monographs, 
as well as in a voluminous research literature. Modifications can occur anywhere in a 
polypeptide, including the peptide backbone, the amino acid side-chains and the amino 
or carboxyl termini. It will be appreciated that the same type of modification may be 

35 present in the same or varying degrees at several sites in a given polypeptide. Also, a 
given polypeptide may contain many types of modifications. Polypeptides may be 
branched , for example, as a result of ubiquitination, and they may be cyclic, with or 
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without branching. Cyclic, branched, and branched cyclic polypeptides may result 
from posttranslation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
5 nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine, 
formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 
anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, 

10 pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 

selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins 
such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton. W. 
H. Freeman and Company, New York (1993); POSTTRANSLATION AL 

15 COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic 

Press, New York, pgs. 1-12 (1983); Seifter et al., Meth Enzymol 182:626-646 (1990); 
Rattan et al., Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X" refers to a polynucleotide sequence while "SEQ ID NO: Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 

20 Table L 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
activity similar, but not necessarily identical to, an activity of a polypeptide of the 
present invention, including mature forms, as measured in a particular biological assay, 
with or without dose dependency. In the case where dose dependency does exist, it 

25 need not be identical to that of the polypeptide, but rather substantially similar to the 
dose-dependence in a given activity as compared to the polypeptide of the present 
invention (i.e., the candidate polypeptide will exhibit greater activity or not more than 
about 25-fold less and, preferably, not more than about tenfold less activity, and most 
preferably, not more than about three-fold less activity relative to the polypeptide of the 

30 present invention.) 

Polynucleotides and Polypeptides of the Invention 

FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

35 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: NYFPVHTVQPNWYV (SEQ ID NO:77). Polynucleotides 
encoding these polypeptides are also encompassed by the invention. The gene encoding 
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the disclosed cDN A is thought to reside on chromosome 1 . Accordingly, 
polynucleotides related to this invention are useful as a marker in linkage analysis for 
chromosome 1. 

This gene is expressed primarily in whole brain and infant brain tissues, and to 
5 a lesser extent in T-cells and fetal lung tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurodegenerative disorders. Similarly, polypeptides and antibodies 

10 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the central nervous system, expression of this 
gene at significantiy higher or lower levels may be routinely detected in certain tissues 
or cell types (e.g., neural, cancerous and wounded tissues) or bodily fluids (e.g., 

15 lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in neural tissues such as infant and whole brain tissues 

20 indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
the diagnosis and/or treatment of neurodegenerative disorders. Furthermore, the tissue 
distribution in brain tissues indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioural disorders such as Alzheimers Disease, Parkinsons 

25 Disease, Huntingtons Disease, Tourette Syndrome, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, panic disorder, learning disabilities, ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. 

In addition, the gene or gene product may also play a role in the treatment and/or 

30 detection of developmental disorders associated witii the developing embryo, or 

sexually-linked disorders. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. The first approximately 333 nt of sequence shown in the sequence listing is 
vector sequence which will immediately be recognized by those of skill in the art. 

35 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 1 1 and may have been publicly available prior to conception of the present 
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invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 776 of SEQ ID NO: 11 , b is an integer of 15 to 
790, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 1 1, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

This gene is expressed primarily in colon tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, gastrointestinal disorders and colon cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the gastrointestinal system, expression of 
^this gene at significantly higher or lower levels may be routinely detected in certain 
tissues or cell types (e.g., gastrointestinal, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
45 as residues: Ser-69 to Lys-74. 

The tissue distribution in colon tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
colon cancer. Furthermore, the tissue distribution in gastrointestinal tissues (colon) 
indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
the diagnosis, prevention, and/or treatment of various metabolic disorders such as Tay- 
Sachs disease, phenylkenonuria, galactosemia, porphyrias, and Hurler's syndrome. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 12 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 




wo 99/46289 



PCT/US99/05721 



8 



scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 540 of SEQ ID NO: 12, b is an integer of 15 to 
5 554, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 12, and where b is greater than or equal to a -h 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

In specific embodiments, polypeptides of the invention comprise the following 
10 amino acid sequence: PVFTVNFLAWVHAPPVSITVDLIPTLAQAWS (SEQ ID 
NO:78). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 



15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, gastrointestinal disorders and colon cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

20 of the above tissues or cells, particularly of the gastrointestinal systems, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues or cell types (e.g., gastrointestinal, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fiuid) taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

25 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in colon tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and treatment of 
colon cancer. Furthermore, the tissue distribution in gastrointestinal tissues (colon) 

30 indicates that polynucleotides and polypeptides corresponding to this gene are useful for 
the diagnosis, prevention, and/or treatment of various metabolic disorders such as Tay- 
Sachs disease, phenylkenonuria, galactosemia, porphyrias, and Hurler's syndrome. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

35 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 13 and may have been publicly available prior to conception of the present 



This gene is expressed primarily in colon tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 



BNSDOCID: <WO. 



.9946289A1J_> 



wo 99/46289 PCT/US99/05721 



invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
5 a-b, where a is any integer between 1 to 1092 of SEQ ID NO: 13, b is an integer of 15 
to 1 106, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO: 13, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 
10 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: WIQRIRTSADQLGPKKVVXFGLACCGVSGLFYA (SEQ ID 
NO:79). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

This gene is expressed primarily in CD34 positive cells. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, inflanmiation, allergy and graft rejection, and immune system disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
hematopoietic and immune systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues or cell types (e.g., immune, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 

25 synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in CD34 positive cells indicates that polynucleotides and 
30 polypeptides corresponding to this gene are useful for the diagnosis and/or treatment of 
hematopoietic and inmiune disorders such as inflammation, as well as immune 
modulation and differentiation. Furthermore, expression of this gene product in CD34 
positive cells indicates a role in the regulation of the proliferation; survival; 
differentiation; and/or activation of potentially all hematopoietic cell lineages, including 
35 blood stem cells. This gene product may be involved in the regulation of cytokine 

production, antigen presentation, or other processes that may also suggest a usefulness 
in the treatment of cancer (e.g. by boosting immune responses). 
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Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also used 
as an agent for inmiunological disorders including arthritis, asthma, immune deficiency 
5 diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory bowel disease, 
sepsis, acne, and psoriasis. In addition, this gene product may have commercial utility 
in the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 

10 immunotherapy targets for the above listed tissues. Those of skill in the art will 

recognize that some vector nucleotide sequence is contained at the 5' and 3* ends of the 
sequence shown for this gene in the sequence listing. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

15 ID NO: 14 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

20 a-b, where a is any integer between 1 to 554 of SEQ ID NO: 14, b is an integer of 15 to 
568, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO: 14, and where b is greater than or equal to a + 14. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 5 
25 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequences: PPGLCAAIPLQTRSAQGPWGGRQGSGWCWGTWGSGSS 
GGGNAFTGLGPVSTLPSLHGKQGVTSVTCHGGYVYTTGRXGAYYQLFVRDG 
QLQP\a.RQKSCRGMNWLAGLRIVPDGSMVILGFHANEFVVWNPRSHEKLHIV 
NCGGGHRSWAFSDTEAAMAFAYLKDGDVMLYRALGGCTRPHVILREGLHGR 
30 EITCVKRVGTTTLGPEYGVPSFMQPDDLEPGSEGPDLTDIVITCSEDTTVCVLALP 
TTTGSAHALTAVCNHISSVRAVAVWGIGTPGGPQDPQPGLTAHVVSAGGRAE 
MHCFSIM\nrPDPSTPSRLACHVMIlLXSHRLDEYWDRQRN^ 
(SEQ ID NO:80), PPGLCAAIPLQTRSAQGPWGGRQGSGWCWGTVVGSGSS 
(SEQ ID NO:81), GGGNAFTGLGPVSTLPSLHGKQGVTSVTCHGGYVYTTGRX 
35 (SEQ ID NO:82), GAYYQLFVRIXjQLQP\a.RQKSCRGMNWLAGLRrVTDGSMV 
(SEQ ID NO:83), ILGFHANEFWWNPRSHEKLHIVNCGGGHRSWAFSDTEAAM 
(SEQ ID NO:84), AFAYLKIXJDVMLYRAIXjGCIRPHVILREGLHGRETC 
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G (SEQ ID NO:85), TITLGPEYGVPSFMQPDDLEPGSEGPDLTDIVITCSEDTTVCV 
(SEQ ID NO:86), LALPTTTGSAHALTAVCNHISSVRAVAVWGIGTPGGPQDPQ 
(SEQ ID NO:87), PGLTAHVVSAGGRAEMHCFSIMVTPDPSTPSRLACHVMHL 
(SEQ ID NO:88), and/or XSHRLDEYWDRQRNRHRMVKVDPETR (SEQ ID 
5 NO:89). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

This gene is expressed primarily in LNCAP untreated cell line and endometrial 
tumor tissue, and to a lesser extent in other cancerous tissues such as adrenal gland 
tumor tissues and synovial sarcoma tissues. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancers, i.e., uncontrolled cell proliferation and/or differentiation. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

15 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
prostate and endometrial tissues, expression of this gene at significantly higher or lower 
levels may be detected in certain tissues or cell types (e.g., reproductive, 
gastrointestinal, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 

20 plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
25 48 as residues: Lys-37 to Ile-45. 

The tissue distribution in cancerous tissues, such as cancerous tissues of the 
endometrium, synovium, and adrenal gland tissues, indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and/or treatment of 
tumors, as well as for regulating cell proliferation and/or differentiation. Expression 
30 within cellular sources marked by proliferating cells indicates that this protein may play 
a role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. Thus, this protein may also be 
involved in apoptosis or tissue differentiation and could again be useful in cancer 
therapy. Protein, as well as, antibodies directed against the protein may show utility as 
35 a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
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ID NO: 15 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
5 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 3678 of SEQ ID NO: 1 5, b is an integer of 1 5 
to 3692, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO: 15, and where b is greater than or equal to a + 14. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: LMSLLTSPHQPPPPPPASASPSAVPNGPQSPKQQKEPL 
SHRFNEFMTSKPKIHCFRSLKRGVSSAPESCLSGVLWLHVWFCITNFVCE 
(SEQ ID NO:90); FQNAKEEASVLPYVETVFLFGGGIFAMALCLISDALSSYR 

1 5 DSHTNRVLTSPPF (SEQ ID NO:9 1 ); and/or RLMPFPPSSPRLLVTLAGREDVV 

GHSCNTLSAHLLEIVTMLITWF (SEQ ID NO:92). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. The gene encoding the disclosed 
cDNA is thought to reside on chromosome 9. Accordingly, polynucleotides related to 
this invention are useful as a marker in linkage analysis for chromosome 9. 

20 This gene is expressed primarily in activated T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune disorders. Similarly, polypeptides and antibodies directed to 

25 these polypeptides are useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues or cell 
types (e.g., immune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, 

30 serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healtiiy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution primarily in T-cells indicates that polynucleotides and 

35 polypeptides corresponding to tiiis gene arc useful for the diagnosis and/or treatinent of 
immune disorders involving activated T-cells, e.g., in diseases relating to improper 
thymus, Uver, and/or spleen function. Furthermore, expression of tiiis gene product in 
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T-cells indicates a role in the regulation of the proliferation; survival; differentiation; 
and/or activation of potentially all hematopoietic cell lineages, including blood stem 
cells. This gene product may be involved in the regulation of cytokine production, 
antigen presentation, or other processes that may also suggest a usefulness in the 

5 treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the gene or protein, as 
well as, antibodies directed against the protein may show utihty as a tumor marker 
and/or immunotherapy targets for the above listed tissues. Therefore it may be also used 
as an agent for immunological disorders including arthritis, asthma, inmiune deficiency 

10 diseases such as AIDS, leukemia, rheumatoid arthritis, inflammatory bowel disease, 
sepsis, acne, and psoriasis. In addition, this gene product may have commercial utility 
in the expansion of stem cells and committed progenitors of various blood lineages, and 
in the differentiation and/or proliferation of various cell types. Expression of this gene 
product in T cells also strongly indicates a role for this protein in immune function and 

15 immune surveillance. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

20 ID NO: 16 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

25 a-b, where a is any integer between 1 to 1414 of SEQ ID NO: 16, b is an integer of 15 
to 1428, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO: 16, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

30 The translation product of this gene shares sequence homology with a rat 

potassium-dependent sodium-calcium exchanger (See Genbank Accession No. 
gil2662461), as well as one from Bos taurus. These proteins are thought to be 
important in modulating Ca2+ flux across the rod outer segments (ROS) of the retinal 
rod photoreceptors. 

35 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: GGXDDDEGPYTPFDTPSGKLETVKWAFTWPLSFVLYF 
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TVPNCNKPRWEKWF (SEQ ID NO:93). Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

When tested against Jurlcat cell lines, supematants removed from cells 
containing this gene activated the NF-kB transcription factor. Thus, it is likely that this 
gene activates Jurkat cells, and to a lesser extent other immune cells, by activating a 
transcriptional factor found within these cells. Nuclear factor kB is a transcription factor 
activated by a wide variety of agents, leading to cell activation, differentiation, or 
apoptosis. Reporter constructs utilizing the NF-kB promoter element are used to screen 
supematants for such activity. 

Additionally, when tested against Jurkat cell lines, supematants removed from 
cells containing this gene activated the GAS assay. Thus, it is likely that this gene 
activates Jurkat cells, and to a lesser extent in other immune cells, through the Jak- 
STAT signal transduction pathway. The gamma activating sequence (GAS) is a 
promoter element found upstream of many genes which are involved in the Jak-STAT 
pathway. The Jak-STAT pathway is a large, signal transduction pathway involved in 
the differentiation and proliferation of cells. Therefore, activation of the Jak-STAT 
pathway, reflected by the binding of the GAS element, can be used to indicate proteins 
involved in the proliferation and differentiation of cells. 

Likewise, when tested against K562 leukemia cell lines, supematants removed 
20 from cells containing this gene activated the ISRE assay. Thus, it is likely that this gene 
activates leukemia cells, and to a lesser extent other cells, through the Jak-STAT signal 
transduction pathway. The interferon-sensitive response element is a promoter element 
found upstream of many genes which arc involved in the Jak-STAT pathway. The Jak- 
STAT pathway is a large, signal transduction pathway involved in the differentiation 
25 and proliferation of cells. Therefore, activation of the Jak-STAT pathway, reflected by 
the binding of the ISRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. 

This gene is expressed primarily in fetal and infant brain tissues. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
30 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, color blindness, light sensitivity and neurological disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the optic and 
neurological systems, expression of this gene at significanUy higher or lower levels 
may be routinely detected in certain tissues or cell types (e.g., optic, neural, cancerous 
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and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative lo the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

5 The tissue distribution in fetal and infant brain tissues, and the homology to 

retinal potassium-dependent sodium-calcium exchanger gene, indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and/or treatment of various optic disorders related to light adaptation in rod 
photoreceptors such as color blindness and light sensitivity. More generally, the tissue 

10 distribution in brain tissues indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the detection/treatment of neurodegenerative 
disease states and behavioural disorders such as Alzheimers Disease, Parkinsons 
Disease, Huntingtons Disease, Tourette Syndrome, schizophrenia, mania, dementia, 
paranoia, obsessive compulsive disorder, panic disorder, learning disabilities, ALS, 

15 psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. 

In addition, the gene or gene product may also play a role in the treatment and/or 
detection of developmental disorders associated with the developing embryo, or 
sexually-linked disorders. Protein, as well as, antibodies directed against the protein 

20 may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 17 and may have been publicly available prior to conception of the present 

25 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1475 of SEQ ID NO: 17, b is an integer of 15 

30 to 1489, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO: 17, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

The gene encoding the disclosed cDNA is thought to reside on chromosome 17. 
35 Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 17. 
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This gene is expressed primarily in placental tissue, and to a lesser extent in 
breast tissue and melanocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, breast cancer and melanoma. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes.for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune, metabolic and integumental 
systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues or cell types (e.g., immune, metabolic, integumentary, 
cancerous and wounded tissues) or bodily fluids (e.g.. lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution in placental and breast tissues indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and/or treatment of certain cancers, including breast cancer and melanomas. 
Expression within embryonic tissue and other cellular sources marked by proliferating 
cells indicates that this protein may play a role in the regulation of cellular division, and 
may show utility in the diagnosis and treatment of cancer and other proliferative 
disorders. Similarly, embryonic development also involves decisions involving cell 
differentiation and/or apoptosis in pattern formation. Thus, this protein may also be 
involved in apoptosis or tissue differentiation and could again be usefiil in cancer 
therapy. Protein, as well as, antibodies directed against the protein may show utility as 
a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 18 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1926 of SEQ ID NO: 18, b is an integer of 15 
to 1940. where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO: 1 8, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

The translation product of this gene shares a very small block of sequence 

5 homology with human hemopoietic cell protein-tyrosine kinase (HCK). The hck gene 
encodes a 505-residue polypeptide that is closely related to pp561ck, a lymphocyte- 
specific protein-tyrosine kinase. The exon breakpoints of the hck gene, partially defined 
by using murine genomic genes, demonstrate that hck is a member of the src gene 
family and has been subjected to strong selection pressure during mammalian evolution. 

10 High-level expression of hck transcripts in granulocytes is especially provocative since 
these cells are terminally differentiated and typically survive in vivo for only a few 
hours. 

Thus the hck gene, like other members of the src gene family, appears to 
function primarily in cells with little growth potential. The translation product of this 

15 gene is expected to share certain biological activities with HCK based on the sequence 
similarity between the proteins. The gene encoding the disclosed cDNA is thought to 
reside on chromosome 20. Accordingly, polynucleotides related to this invention are 
useful as a marker in linkage analysis for chromosome 20. 

This gene is expressed primarily in human prostate cancer, and to a lesser extent 

20 in activated neutrophils and primary dendritic cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, prostate cancer; hematopoietic disorders; immune dysfunction; 

25 susceptibility to infection; and inflammation. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the prostate and/or immune system, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 

30 tissues or cell types (e.g., gastrointestinal, immune, cancerous and wounded tissues) or 
bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

35 The tissue distribution in prostate cancer tissue, dendritic cells and neutrophils, 

and the short block of homology to hck, indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis and/or treatment of prostate 



# 



wo 99/46289 PCT/US99/0572 1 

18 



cancer, as well as disorders of the immune system. For example, this gene product is 
thought to play a role in the abnormal cellular proliferation that accompanies prostate 
cancer. Inhibitors of the action of this gene product have beneficial therapeutic 
application in the treatment of prostate cancer. Alternately, the expression in neutrophils 
and dendritic cells indicates that this gene product may play a role in the survival, 
proliferation, and/or differentiation of hematopoietic cells, and may play key roles in 
inflammation and immunity. Protein, as well as, antibodies directed against the protein 
may show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 19 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1578 of SEQ ID NO: 19, b is an integer of 15 
to 1592, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO: 19, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

The gene encoding the disclosed cDNA is thought to reside on chromosome 13. 
Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 13. 

This gene is expressed primarily in primary dendritic cells. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune disorders; defects in immunity; susceptibility to infections; and 
hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues or cell types (e.g., immune, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
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expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
53 as residues: Glu-35 to Lys-44, Cys-83 to Gly-SS, 
5 The tissue distribution in primary dendritic cells indicates that protein products 

of this gene are useful for the diagnosis and/or treatment of a variety of immune 
disorders. Expression of this gene product by dendritic cells indicates that it may play a 
role in the immune recognition/presentation process and may therefore be involved in 
the regulation of immunity. Alternately, it may represent a protein that is produced by 

10 cells, such as dendritic cells, that then has an effect on other hematopoietic cell types, 
and thereby regulates their survival, proliferation, activation, and/or differentiation. 
Therefore this gene product may have therapeutic benefit in a variety of hematopoietic 
disorders. Furthermore, expression of this gene product in primary dendritic cells also 
strongly indicates a role for this protein in immune function and immune surveillance. 

15 Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
^ and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:20 and may have been publicly available prior to conception of the present 

20 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1396 of SEQ ID NO:20, b is an integer of 15 

25 to 1410, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:20, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

This gene is expressed primarily in primary dendritic cells. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hematopoietic disorders; immune dysfunction; impaired immunity; and 
susceptibility to infections. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or ceil type(s). For a number of disorders of the above tissues or cells, 
particularly of the inmiune system, expression of this gene at significantly higher or 
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lower levels may be routinely detected in certain tissues or cell types (e.g., immune, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid ft-om an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
54 as residues: Ala- 107 to Ser-1 12. 

The tissue distribution in primary dendritic cells indicates that protein products 
of this gene are useful for the diagnosis and/or treatment of hematopoietic disorders. 
Expression of this gene product specifically in primary dendritic cells indicates that it 
may play a role in immune responses. Therefore, it may have clinical utility in a variety 
of disorders that are characterized by impaired immune function or susceptibility to 
infection. Alternately, it may represent a gene product that is produced by specific cells, 
such as dendritic cells, that has effects on the survival, activation, proliferation, and/or 
differentiation of other cell types, most notably other hematopoietic cell lineages. 
Therefore, the gene product may have clinical utility in the treatment of a variety of 
hematopoietic disorders. Furthermore, expression of this gene product in primary 
dendritic cells also strongly indicates a role for this protein in immune function and 
immune surveillance. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:21 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1713 of SEQ ID NO:21, b is an integer of 15 
to 1727, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:2 1 , and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 12 
This gene is expressed primarily in primary dendritic cells. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

rcagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, hematopoietic disorders; immune dysfunction; susceptibility to infection; 
and inflammation. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 

5 tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues or cell types (e.g., immune, 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 

10 having such a disorder, relative to the standard gene expression level, i.e., the 

expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
55 as residues: Ser-106 to Leu- 1 13, 

15 The tissue distribution in primary dendritic cells indicates that the protein 

products of this gene are useful for the diagnosis and/or treatment of immune disorders. 
Expression of this gene specifically in dendritic cells indicates a role in immune 
function. Therefore, this gene product may be clinically useful in disorders marked by 
impaired or altered immune function, such as susceptibility to infection or impaired 

20 immune surveillance. Alternately, this may represent a gene product that is produced by 
cells such as dendritic cells that has an effect on the survival, proliferation, activation, 
and/or differentiation of other cell types, most notably other hematopoietic cells. 
Therefore, it may have clinical utility in treating a broad range of hematopoietic 
disorders and in increasing stem cell numbers. Furthermore, expression of this gene 

25 product in primary dendritic cells also strongly indicates a role for this protein in 
immune function and innmune surveillance. Protein, as well as, antibodies directed 
against the protein may show utility as a tumor marker and/or immunotherapy targets 
for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 

30 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:22 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 

35 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1204 of SEQ ID NO:22, b is an integer of 15 
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to 1218, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:22, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 13 
5 The gene encoding the disclosed cDNA is thought to reside on chromosome 1 . 

Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 1. 

This gene is expressed primarily in fetal liver and spleen tissues, and to a lesser 
extent in breast tissue and Hodgkin's lymphoma. 
10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune dysfunction; hematopoietic disorders; breast cancer; and 
Hodgkin's lymphoma. Similarly, polypeptides and antibodies directed to these 
15 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system and/or breast, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues or cell types (e.g., 
br6ast, inunune, cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
20 plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 
having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
25 56 as residues: Tyr-41 to Pro-46. 

The tissue distribution in fetal liver/spleen tissue, breast tissue, and Hodgkin's 
lymphoma, indicates that the protein products of this gene are useful for the diagnosis 
and/or treatment of a variety of hematopoietic disorders, including Hodgkin's 
lymphoma, as well as disorders of the breast, most notably breast cancer, as well as 
30 cancers of other tissues where expression has been observed. Expression of this gene 
product in hematopoietic tissues, particularly tissues involved in hematopoiesis such as 
fetal liver, suggest that it may play a role in the survival, proliferation, activation, and/or 
differentiation of hematopoietic lineages. Particularly, expression in Hodgkin's 
lymphoma indicates that it may be involved in proliferation and/or transformation, 
35 suggesting that it may also contribute to a variety of cancer processes. Expression in the 
breast indicates that it may be involved in normal breast function, in breast cancer, as a 
vital nutrient to infants during lactation, or may reflect expression within the lymph 
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nodes of the breast. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
5 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:23 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
10 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 698 of SEQ ID NO:23, b is an integer of 1 5 to 
712, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:23, and where b is greater than or equal to a + 14. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 14 

The gene encoding the disclosed cDNA is thought to reside on chromosome 8. 
Accordingly, polynucleotides related to this invention are useful as a marker in linkage 
analysis for chromosome 8, 

This gene is expressed primarily in infant brain tissue, and to a lesser extent in 

20 T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, neurodegenerative disorders. Similarly, polypeptides and antibodies 
25 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the central nervous system (CNS), expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues or cell types (e.g., neural, cancerous and wounded tissues) or bodily fluids 
30 (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
35 57 as residues: Ala-67 to Glu-72, Thr-91 to He- 100. 

The tissue distribution in infant brain tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the detection/treatment of 
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neurodegenerative disease states and behavioural disorders such as Alzheimers Disease, 
Parkinsons Disease, Huntingtons Disease, Tourette Syndrome, schizophrenia, mania, 
dementia, paranoia, obsessive compulsive disorder, panic disorder, learning 
disabilities. ALS, psychoses, autism, and altered behaviors, including disorders in 
5 feeding, sleep patterns, balance, and perception. 

In addition, the gene or gene product may also play a role in the treatment and/or 
detection of developmental disorders associated with the developing embryo, or 
sexually-linked disorders. Protein, as well as, antibodies directed against the protein 
may show uUlity as a tumor marker and/or immunotherapy targets for the above listed 
10 tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:24 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 

1 5 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1408 of SEQ ID NO:24, b is an integer of 15 
to 1422, where both a and b correspond to the positions of nucleotide residues shown 

20 in SEQ ID NO:24. and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 15 

The translation product of this gene shares sequence homology with 
phosphatidylethanolamine N-methyltransferase (isolated from rat) which is thought to 
be important in catalyzing the synthesis of phosphatidylcholine from 
phosphatidylethanolamine in hepatocytes (See Genbank Accession No.: g310195 and 
J. Biol. Chem. 268 (22), 16655-16663 (1993)). Based on the sequence similarity 
between rat phosphatidylethanolamine N-methyltransferase and the translation product 
of this gene, the two proteins are expected to share certain biological activities. 

In specific embodiments, polypeptides of the invention comprise the following 
aniino acid sequences: GGPRMKRSGNPGAEVTNSSVAGPDCCGGLGNIDFRQA 
DFCVMTRmSYVDPLDPSFVAAXOTITFNPLYWNVVARWEHKTRKLSRAFGSP 
YLACYSLSXTILLLNFLRSHCFTQA (SEQ ID NO:93); GGPRMKRSGNPGAEVT 
NSSVAGPDCCGGLGNIDFRQADFCVMTRLLG YVDP (SEQ ID NO:94); and/or 

lX>PSFVAAVnTrFNPLYWNVVARWEHKTRKl^RAFGSPYLACYSI^XTILL 
LNFLRSHCFTQA (SEQ ID NO:96). Polynucleotides encoding these polypeptides are 
also encompassed by tiie invention. The gene encoding the disclosed cDNA is tiiought 
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to reside on chromosome 17. Accordingly, polynucleotides related to this invention are 
useful as a marker in linkage analysis for chromosome 17. 

This gene is expressed primarily in liver cells, and to a lesser extent in placental 

tissue. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, liver failure and liver metabolic disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inmiunological probes 
10 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the endocrine and hepatic systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues or cell types (e.g., liver, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 
15 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
58 as residues: Pro-5 to Leu- 10. 
20 The tissue distribution in liver tissue, and the homology to 

phosphatidylethanolamine N-methyla*ansferase, indicates that the protein products of 
this gene are useful for the treatment and/or diagnosis of diseases of the liver, and 
cancers (e.g. hepatoblastoma, jaundice, hepatitis, liver metabolic diseases and 
conditions that are attributable to the differentiation of hepatocyte progenitor cells). 
25 Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:25 and may have been publicly available prior to conception of the present 
30 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1024 of SEQ ID NO:25, b is an integer of 15 
35 to 1038, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:25, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

The translation product of this gene shares sequence homology with heat shock 
protein 90, which is thought to be important in cellular proliferation. In specific 
embodiments, polypeptides of the invention comprise the following amino acid 
sequences: PQRSELAAASNRPCRVCISLLLCLEDRTMPKKAKPTGSGKEEGP 
APCKQMKLEAAGGPSALNFDSPSSLFESLISPIKTETFFKEFWEQKPLLIQRDD 
PALATYYGSLFKLTDLKSLCSRGMYYGRDVNVCRCVNGKKKVLNKDGKAHF 
LQLRKDFDQKRATIQFHQPQRFKDELWRIQEKLECYFGSLVGSNVYTTPADLRA 
CRPIMMMSRFSSCSWRERNTGASTTPLCPWHESTAWRPRKGSAGRCMSLC 
(SEQ ID NO:97); PQRSELAAASNRPCRVCISLLLCLEDRTMPKKAKFTGSGKEE 
GP (SEQ ID NO:98); APCKQMKLEAAGGPSALNFDSPSSLFESLISPIKTETFFKE 
FWEQ (SEQ ID NO:99); KPLLIQRDDPALATYYGSLFKLTDLKSLCSRGMYYGR 
DVNVCRC (SEQ ID NO: 100); VNGKKKVLNKDGKAHFLQLRKDFDQKRATIQF 
15 HQPQRFKDELWRI (SEQ ID NO: 101); QEKLECYFGSLVGSNVYITPADLRACRPI 
MMMSRFSSCSWRERN (SEQ ID NO: 102); and/or TGASTTPLCPWHESTAWRPR 
KGSAGRCMSLC (SEQ ID NO: 103). Polynucleotides encoding these polypeptides are 
also encompassed by the invention. The gene encoding the disclosed cDNA is thought 
to reside on chromosome 3. Accordingly, polynucleotides related to this invention are 
20 useful as a marker in linkage analysis for chromosome 3 . 

This gene is expressed primarily in placental tissue, and to a lesser extent in 
melanocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, systemic lupus erythematosus and other autoimmune diseases, acute 
leukemia, and developmental disorders. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful to provide immunological probes for differential 
identification of the ussue(s) or cell type(s). For a number of disorders of the above 

30 tissues or cells, particularly of the immune and developing systems, expression of this 
gene at significantiy higher or lower levels may be routinely detected in certain tissues 
or cell types (e.g., immune, developing, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 
Ussue or cell sample taken from an individual having such a disorder, relative to the 

35 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
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Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
59 as residues: His-13 to Leu-21, Glu-36 to Tyr-44, Thr-103 to Trp-109. 

The tissue distribution in placental tissue, and the homology to heat shock 
protein 90, indicates that the protein products of this gene are useful for the treatment 

5 and/or diagnosis of systemic lupus erythematosus, since in SLE there is an 

overexpression of this protein, its surface localization and auto-antibodies to it have 
been observed. More generally, the tissue distribution in placental tissue indicates that 
polynucleotides and polypeptides corresponding to this gene are useful for the 
diagnosis and/or treatment of disorders of the placenta. Specific expression within the 

10 placenta indicates that this gene product may play a role in the proper establishment and 
maintenance of placental function. 

Alternately, this gene product may be produced by the placenta and then 
transported to the embryo, where it may play a crucial role in the development and/or 
survival of the developing embryo or fetus. Expression of this gene product in a 

15 vascular-rich tissue such as the placenta also indicates that this gene product may be 

produced more generally in endothelial cells or within the circulation. In such instances, 
it may play more generalized roles in vascular function, such as in angiogenesis. It may 
also be produced in the vasculature and have effects on other cells within the 
circulation, such as hematopoietic cells. It may serve to promote the proliferation, 

20 survival, activation, and/or differentiation of hematopoietic cells, as well as other cells 
throughout the body. Protein, as well as, antibodies directed against the protein may 
show utility as a tumor marker and/or immunotherapy targets for the above listed 
tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
25 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:26 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
30 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1892 of SEQ ID NO:26, b is an integer of 15 
to 1906, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:26, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 17 

The translation product of this gene shares sequence homology with 
prostaglandin D synthetase, which is thought to be important in blood-tissue barriers. 
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In specific embodiments, polypeptides of the invention comprise the following amino 
acid sequence: GGGIHRLHNGALQLRVLQRVEHLHLLHHAVKHICTASLPVLHG 
FIAAQCRPGX (SEQ ID NO: 104). Polynucleotides encoding these polypeptides are 
also encompassed by the invention. 
5 This gene is expressed primarily in epididymus tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, multiple sclerosis, Meckel syndrome, polycystic kidney disease, and 
10 reproductive disorders. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous, reproductive, and renal systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues or cell 
15 types (e.g., neural, renal, reproductive, cancerous and wounded tissues) or bodily 
fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
20 The tissue distribution in epididymus tissue, and the homology to prostaglandin 

D synthetase, indicates that the protein products of this gene are useful for the treatment 
and/or diagnosis of diseases related to the blood-tissue, biood-cerebrospinal fluid, 
blood-retina, blood aqueous humor, and blood-testis barriers. More generally, the 
protein product of this gene, based upon its tissue distribution, is useful for the 
25 detection and or treatment of male reproductive disorders concerning dysfunction of the 
epididymus. Protein, as well as, antibodies directed against the protein may show utility 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
30 ID NO;27 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleoUde sequence described by the general formula of 
35 a-b, where a is any integer between 1 to 833 of SEQ ID NO:27, b is an integer of 15 to 
847. where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:27, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 18 

The translation product of this gene shares sequence homology with fructose 
transporter protein and other sugar transporter proteins. Based on the sequence 
5 similarity to other sugar transporter proteins the translation product of this gene is 

expected to share certain biological activities with these proteins such as sugar transport 
activities. Such activities can be assayed by methods known to those of skill in the art. 

When tested against fibroblast cell lines, supematants removed from cells 
containing this gene activated the EGRl (early growth response gene 1) promoter 

10 element. Thus, it is likely that this gene activates fibroblast cells, and to a lesser extent, 
in integumentary cells and tissues, through the EGRl signal transduction pathway. 
EGRl is a separate signal transduction pathway from Jak-STAT, genes containing the 
EGRl promoter are induced in various tissues and cell types upon activation, leading 
the cells to undergo differentiation and proliferation. 

15 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: WDRWSDSALRRLRGSGDLAGELEELEEERAACQGCRAR 
RPWELFQHRALRRQVTSLVVLGSAMELCGNDSVYAYASSVFRKAGVPEAKIQY 
AIIGTGSCELLTAVVSVSLEGALPPPALWGGTPRSSALNQFTLQKKKKKKKKK 
KKKKKKK (SEQ ID NO: 105); RRLRGSGDLAGELEELEEERAACQGCRARRPW 

20 ELFQH (SEQ ID NO: 106); RQVTSLVVLGS AMELCGNDS VYAYASSVF (SEQ ID 
NO: 107); and/or TGSCELLTAVVSVSLEGALPPPALWGGTPRSSAL (SEQ ID 
NO: 108). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. 

This gene is expressed primarily in endometrial stromal cells. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, reproductive and metabolic diseases and/or disorders, particularly 
diabetes. Similarly, polypeptides and antibodies directed to these polypeptides are 

30 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the endocrine system, expression of this gene at significantly higher or lower levels 
may be detected in certain tissues or cell types (e.g., reproductive, metabolic, and/or 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 

35 synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
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expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
61 as residues: Phe-45 to Trp-50, Ala-52 to Pro-59, Ser-149 to Leu-154, Gly-219 to 
5 Cys-233. 

The homology to sugar transporter proteins (particularly the GLUT5 protein) 
indicates that the protein products of this clone are useful for the treatment and/or 
diagnosis of sugar metabolism disorders such as diabetes. Further, polynucleotides and 
polypeptides of the present invention may be expressed in vivo by administration of the 

10 claimed polynucleotide and polypeptides (see Geneseq T66495-96) for treatment of 

diabetes, or expressed in a host cell to prepare a recombinant cell that secretes insulin in 
response to glucose and which can be administered to a patient to treat diabetes. 
Alternatively, the tissue distribution in endometrial stromal cells, combined with the 
detected EGRl biological activity, suggests the protein is useful for the diagnosis, 

15 treatment, and/or prevention of reproductive and developmental diseases and/or 
disorders. The protein is useful in the treatment and/or detection of proliferative 
conditions. Protein, as well as, antibodies directed against the protein may show utihty 
as a tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 

20 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:28 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence would be cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 

25 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 971 of SEQ ID NO:28, b is an integer of 15 to 
985, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:28, and where b is greater than or equal to a + 14, 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 19 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: HELRLRKNTVKPSLYRHFKOTLIFAVLASIVFMGWTTK 
TFRIAKCQSDW (SEQ ID NO: 109). Polynucleotides encoding these polypeptides are 
also encompassed by the invention. 
3^ This gene is expressed primarily in endometrial tumor tissue, and to a lesser 

extent in placental tissue. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s ) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental and reproductive diseases and/or disorders, particularly 
5 endometrial tumors. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the reproductive system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues or cell types (e.g., 

10 developmental, reproductive, and cancerous and wounded tissues) or bodily fluids 

(e.g., lymph, serum, plasma, urine, amniotic fluid, synovial fluid and spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

15 Preferred epitopes include those comprising a sequence shown in SEQ ID NO. 

62 as residues: Pro-27 to Arg-33, Asp-41 to Ile-47, Thr-73 to Asp-85. 

The tissue distribution in endometrial tumor tissue and placental tissue indicates 
, that protein products of this gene are useful for the treatment, diagnosis, and/or 
prevention of endometrial tumors, as well as tumors of other tissues where expression 

20 has been observed. Moreover, polynucleotides and polypeptides corresponding to this 
gene are useful for the diagnosis, detection, and/or treatment of developmental 
disorders. 

The relatively specific expression of this gene product in placental tissue and the 
endometrium indicates it may be a key player in the proliferation, maintenance, and/or 

25 differentiation of various cell types during development. It may also act as a morphogen 
to control cell and tissue type specification. Because of potential roles in proliferation 
and differentiation, this gene product may have applications in the adult for tissue 
regeneration and the treatment of cancers. Expression within cellular sources marked by 
proliferating cells indicates this protein may play a role in the regulation of cellular 

30 division, and may show utility in the diagnosis and treatment of cancer and other 
proliferative disorders. 

Similarly, developmental tissues rely on decisions involving cell differentiation 
and/or apoptosis in pattern formation. Dysregulation of apoptosis can result in 
inappropriate suppression of cell death, as occurs in the development of some cancers, 

35 or in failure to control the extent of cell death, as is believed to occur in acquired 

immunodeficiency and certain neurodegenerative disorders, such as spinal muscular 
atrophy (SMA). Therefore, the polynucleotides and polypeptides of the present 
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invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein may 
modulate apoptosis or tissue differentiation and is useful in the detection, treatment, 
and/or prevention of degenerative or proliferative conditions and diseases. Protein, as 
5 well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 29 and may have been publicly available prior to conception of the present 

10 invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 900 of SEQ ID NO:29, b is an integer of 15 to 

15 914, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:29, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

The translation product of this gene was shown to have homology to the 

20 conserved dolichyl-phosphate beta-glucosyltransferase from Saccharomyces cerevisiae 
and S. pombe (See Genebank Accession No. gil535141) which is important in protein 
trafficing, post-translational processing and modification of proteins, protein secretion, 
and stabilizing secreted proteins. Proteins involved in glycosylation events have uses 
which are well known in the art, and that supercede those mentioned above. 

25 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: WIPRAAGIRHEESIAQRS YFRTLL (SEQ ID NO: 1 10); ADTN 
FTQETAMTMITPSSKLTLTKGNKSWSSTAVAAALELVDPPGCRNSARGINCSAF 
LLPYSSHVWVPLSGVVPLCQRNQGHTVWVQIIYSRSSFTDVHSTR (SEQ ID 
NO: 111); MTMITPSSKLTLTKGNKSWSSTAVAA (SEQ ID NO: 1 12); RGINCS 

30 AFLLPYSSH VWVPL (SEQ ID NO: 1 1 3); and/or VVPLCQRNQGHTVWVQHYSRS 
SF (SEQ ID NO: 114). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

This gene is expressed primarily in infant brain tissue, and to a lesser extent in 
ovarian cancer tissue. 

35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
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not limited to, developmental, metabolic, neural, and proliferative diseases and/or 
disorders, particularly multiple sclerosis, dementia, and ovarian cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell lype(s). For a 

5 number of disorders of the above tissues or cells, particularly of the central nervous 

system and reproductive system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues (e.g., developmental, metabolic, 
proliferative, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, 
serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 

10 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
63 as residues: Gly-26 to Gln-32, Pro-42 to Ser-50. 

15 The tissue distribution in infant brain tissue indicates that the protein products of 

this gene are useful for the treatment and/or diagnosis of defects or problems associated 
with developmental processes, particularly in the brain. The homology to dolichyl- 
phosphate beta-glucosyltransferase from Saccharomyces cerevisiae and S. pombe 
indicates that the protein plays a vital role in normal cellular and protein metabolism and 

20 is useful in treating proliferative disorders, in addition to, correcting metabolic 

deficiencies via gene therapy (i.e. protein may be required for proper conformation and 
stability of key secreted protein or enzyme and the stable insertion of the encoding gene 
into a stem cell may correct this deficit). 



25 proliferating cells indicates this protein may play a role in the regulation of cellular 
division, and may show utility in the diagnosis and treatment of cancer and other 
proliferative disorders (i.e. may inhibit key cell cycle regulators via inhibition of 
endogenous equivalent of present invention). Similarly, developmental tissues rely on 
decisions involving cell differentiation and/or apoptosis in pattern formation. 

30 Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 
death, as is believed to occur in acquired immunodeficiency and certain 
neurodegenerative disorders, such as spinal muscular atrophy (SMA). 



35 useful in treating, detecting, and/or preventing said disorders and conditions, in 
addition to other types of degenerative conditions. Thus this protein may modulate 
apoptosis or tissue differentiation and is useful in the detection, treatment, and/or 



The expression within infant tissue and other cellular sources marked by 



Therefore, the polynucleotides and polypeptides of the present invention are 
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prevention of degenerative or proliferative conditions and diseases. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
E) NO: 30 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 11 69 of SEQ ID NO:30, b is an integer of 1 5 
to 11 83, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:30, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

The translation product of this gene shares sequence homology with chicken 
ring zinc fmger protein, which is thought to be important in the regulation of 
transcription. Zinc ring finger proteins have uses well known in the art, and which are 
described elsewere herein. Briefly, the protein may be involved in inter-cellular 
communication and proliferation events, leading to migration or differentiation, and 
possibly apoptosis and cell death. The protein was subsequently cloned and sequenced 
by another group (See, for example, Lomax,M.I., Prim. Sens. Neuron (1998), which 
is hereby incorporated by reference, herein). 



In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: IRRLDOSIFDIKVLNAQRAGYKAArVHNVDSDDLISMGSN 
Dffi\a.KKroiPSVFIGESSANSLKDEFTYEKGGHLILVPEFSLPLEYYLIPFLnVGI 
CLILrVIFMTTKFVQDRHRARRNRLRKDQLKKLPVHKFKKG 
EDGDKLRILPCSHAYHCKCVDPWLTKTKKTCPVCKQKVVPSQGDSDSDTDSS 
QEENEVTEHTPIXRPLASVSAQSFGAL^ESRSHQNMTESSDYEEDDNEDTDSSD 
AE (SEQ ID NO: 122); NFDIKVLNAQRAGYKAAIVHNVDSDD (SEQ ID NO: 1 15); 
VLKKIDIPSVFIGESSANSLKDEFTYEK (SEQ ID NO: 116); PEFSLPLEYYLIPFL 
nVGICULIVIFMI (SEQ ID NO: 1 17); TKFVQDRHRARRNRLRKDQLKKLPVHK 
FKKGDEY (SEQ ID NO:l 18); EDGDKLRILPCSHAYHCKCVDPWLTKT (SEQ ID 
NO: 1 19); WPSQGDSDSDTDSSQEENEVTEH (SEQ ID NO: 1 20); and/or QSFGALS 
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ESRSHQNMTESSDYEEDDNEDT (SEQ ID NO: 121). The gene encoding the 
disclosed cDNA is believed to reside on chromosome 3. Accordingly, polynucleotides 
related to this invention are useful as a marker in linkage analysis for chromosome 3. 
This gene is expressed many adult and fetal tissues. 



reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, many diseases such as developmental, immune, and neural diseases 
and/or disorders. Similarly, polypeptides and antibodies directed to these polypeptides 

10 are useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hematopoietic system, central nervous system, immune system and 
others, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues or cell types (e.g., developmental, immune, neural, and 

15 cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

20 Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 

64 as residues: Asn-43 to Asp-49, Ser-71 to Ala-76, Pro-84 to Gly-91. 

The tissue distribution in fetal tissues, combined with the homology to ring zinc 
proteins, indicates that the protein products of this gene are useful for treating and/or 
diagnosing diseases in the immune system, hematopoietic system and developmental 

25 disorders. The secreted protein can also be used to determine biological activity, to raise 
antibodies, as tissue markers, to isolate cognate ligands or receptors, to identify agents 
that modulate their interactions, and as nutritional supplements. 

The protein product of this clone may also have a very wide range of biological 
activities. Typical of these are cytokine, cell proliferation/differentiation modulating 

30 activity or induction of other cytokines; immunostimulating/immunosuppressant 
activities (e.g. for treating human inmiunodeficiency virus infection, cancer, 
autoimmune diseases and allergy); regulation of hematopoiesis (e.g. for treating anemia 
or as adjunct to chemotherapy); stimulation or growth of bone, cartilage, tendons, 
ligaments and/or nerves (e.g. for treating wounds, stimulation of follicle stimulating 

35 hormone (for control of fertility); chemotactic and chemokinetic activities (e.g. for 
treating infections, tumors); hemostatic or thrombolytic activity (e.g. for treating 
hemophilia, cardiac infarction etc.); anti-inflanunaiory activity (e.g. for treating septic 



5 



Therefore, polynucleotides and polypeptides of the invention are useful as 
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shock, Crohn's disease); as antimicrobials; for treating psoriasis or other 
hyperproliferative diseases; for regulation of metabolism, and behavior. Also 
contemplated is the use of the corresponding nucleic acid in gene therapy procedures. 
Moreover, the expression within fetal tissue indicates this protein may play a 
5 role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. Similarly, developmental tissues 
rely on decisions involving cell differentiation and/or apoptosis in pattern formation. 
Dysregulation of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 
10 death, as is believed to occur in acquired immunodeficiency and certain 

neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types of 
degenerative conditions. Thus this protein may modulate apoptosis or tissue 
15 differentiation and is useful in the detection, treatment, and/or prevention of 

degenerative or proliferative conditions and diseases. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or immunotherapy 
targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
20 and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:31 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
25 polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 2363 of SEQ ID NO:3 1 , b is an integer of 1 5 
to 2377, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:3 1 , and where b is greater than or equal to a + 14. 

30 FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

The translation product of this gene shares sequence homology widi kidney 
transporter, which is thought to be important in kidney function and dialysis (See 
Genebank Accession No: gil383 1566 (AF057039)). This protein was subsequently 
cloned and sequenced by another group (See, for example, Reid,G., Kidney Blood 

35 Press. Res. 21 (2-4), 233-237 (1998), which is hereby incorporated by reference, 
herein). 
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In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: AQCSIYLIQVIFGAVDLPAKLVGFLVINSLGRRPAQ (SEQ ID 
NO: 123); GTVQHLPNPGDLWCCGPACQACGLPCHQLPGSPACPDGCTAAGRHL 
HPAQWGDTPGPVHCPNLSCCAGEGLSGCLLQLHLPVYWELYPTMIRQTGMGM 
GSTMARVGSIVSPLVSMTAELYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDT 
VQDLESRKGKQTRQQQEHQKYMVPLQASAQEKNGL (SEQ ID NO: 124); LPNPG 
DLWCCGPACQACGLPCHQ (SEQ ID NO: 125), GCTAAGRHLHPAQWGDTPGPV 
HCPNL (SEQ ID NO: 126); LHLPVYWELYPTMIRQTGMGMG (SEQ ID NO: 127); 
LVSMTAELYPSMPLFIY GAVPVA (SEQ ID NO: 128); and/or PDTVQDLESRKGKQ 
TRQQQEHQKYMVP (SEQ ID NO: 129). Polynucleotides encoding these polypeptides 
are also encompassed by the invention.The gene encoding the disclosed cDNA is 
believed to reside on chromosome 1 . Accordingly, polynucleotides related to this 
invention are useful as a marker in linkage analysis for chromosome 1 . 

This gene is expressed primarily in fetal brain, fetal kidney and adult kidney 

tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental diseases and/or disorders, particularly kidney and neural 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the renal and urologic system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues or cell types (e.g., developmental, 
neural, renal, urogenital, and cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution in fetal and adult kidney tissues, combined with the 
homology to kidney specific transporter, indicates that the protein products of this gene 
are useful for the treatment and/or diagnosis of renal and urologic disorders, as well as 
developmental disorders of the central nervous system. Moreover, the protein product 
of this gene could be used in the treatment and/or detection of kidney diseases including 
renal failure, nephritus, renal tubular acidosis, proteinuria, pyuria, edema, 
pyelonephritis, hydronephritis, nephrotic syndrome, crush syndrome, 
glomeirulonephritis, hematuria, renal colic and kidney stones, in addition to Wilm's 
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Tumor Disease, and congenital kidney abnormalities such as horseshoe kidney, 
polycystic kidney, and Falconi's syndrome. 

Alternatively, polynucleotides and polypeptides corresponding to this gene are 
useful for the detection, treatment, and/or prevention of neurodegenerative disease 
5 states, behavioral disorders, or inHammatory conditions which include, but are not 
limited to Alzheimer's Disease. Parkinson's Disease, Huntington's Disease, Tourette 
Syndrome, meningitis, encephalitis, demyelinating diseases, peripheral neuropathies, 
neoplasia, trauma, congenital malformations, spinal cord injuries, ischemia and 
infarction, aneurysms, hemorrhages, schizophrenia, mania, dementia, paranoia, 
10 obsessive compulsive disorder, depression, panic disorder, learning disabilities. ALS, 
psychoses, autism, and altered behaviors, including disorders in feeding, sleep 
patterns, balance, and perception. In addition, elevated expression of this gene product 
in regions of the brain indicates it plays a role in normal neural function. Potentially, 
this gene product is involved in synapse formation, neurotransmission, learning, 
15 cognition, homeostasis, or neuronal differentiation or survival. Protein, as well as, 
antibodies directed against the protein may show utility as a tumor marker and/or 
immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
20 ID NO:32 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
25 a-b, where a is any integer between 1 to 78 1 of SEQ ID NO:32, b is an integer of 15 to 
795, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:32, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 23 
30 The translation product of this gene shares sequence homology with the 

ubiquitin-specific protease, UBP2, (See Geneseq Accession No.R36730), which is 
thought to be important in metabolic processes, tissue repair, and wound healing. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: CLEAAMffiGEffiSLHSENSGKSGQEHWFTELPPVLTFELS 
35 RFEFNQALGRPEKIHNia^FPQ\T.YIJDRYMHRNRErrRIKREEIKRLKDYLTVL 
QQRLERYLSYGSGPKRFPLVDVLQYALEFASSKPVCTSPVDDIDASSPPSGSIPS 
QTLPSTTEQQGALSSELPSTSPSSVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHrr 
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EEELSVLESCLHRWRTEIENDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVL 
VHEGQANAGHYWAYIFDHRESRWMKYNDIAVTKSSWEELVRDSFGGYRNAS 
AYCLMYINDKAQFLIQEEFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEE 
WDAQLAQKALQEKLLASQKLRESETSVTTAQAAGDPEYLEQPSRSDFSKHLKE 
5 ETIQIITKASHEHEDKSPETVLQSAIKLEYARLVKLAQEDTPPETDYRLHHWVY 
HQNQAPKKIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYE 
EWHQDYRKFRETTMYLnGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGH 
DEELISHYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEM 
EEKDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPK 
10 LPSYSTHELCERFARIMLSLSRTPADGR (SEQ ID NO: 130); MIEGEffiSLHSENS 
GKSGQEHWFT (SEQ ID NO: 13 1); FELSRFEFNQALGRPEKIHNKLEFP (SEQ ID 
NO: 132); ITRIKREEIKRLKDYLTVLQQRLER (SEQ ID NO: 133); PKRFPLVDVL 
QYALEFASSKPVCTSPV (SEQ ID NO: 134); IPSQTLPSTTEQQGALSSELPSTSPS 
(SEQ ID NO: 135); SVIHKPFTQSRIPPDLPMHPAPRH (SEQ ID NO: 136); CLHRW 
15 RTEIENDTRDLQESISRI (SEQ ID NO:137); KSMIQVPYRLHAVLVHEGQANAG 
HYWAY (SEQ ID NO: 138); RWMKYNDIAVTKSSWEELVRDSFGGYRNA (SEQ 
ID NO: 139); INDKAQFLIQEEFNKETGQPLVGI (SEQ ID NO: 140); MIQVPYRLHA 
VLVHEGQANAGHY (SEQ ID NO: 141); DNQRFEKELEEWDAQLAQKALQEKLL 
(SEQ ID NO: 142); SETSVTTAQAAGDPEYLEQPSRS (SEQ ID NO: 143); QITTKA 
20 SHEHEDKSPETVLQS AIKLEY A (SEQ ID NO: 1 44); LAQEDTPPETDYRLHH WV 
YFIQNQAPK (SEQ ID NO: 145); GDRNLSFDERCHNIMKVAQAKLEMIKPEE 
(SEQ ID NO: 146); EEWHQDYRKFRETTMYLnGLENFQR (SEQ ID NO: 147); CAY 
QNNKELLSKGLYRGHDEELISHYRR (SEQ ID NO: 148); CLLKLNEQAAELFESG 
DREVNNGLDM (SEQ ID NO: 149); VDEMEEKDILAVEDMRNRWCSYLX3QEMEP 
25 HL (SEQ ID NO: 150); and/or QEKLTDFLPKLLDCSMEIKSFHEPP (SEQ ID 

NO: 151). Polynucleotides encoding these polypeptides are also encompassed by the 
invention.The gene encoding the disclosed cDNA is believed to reside on chromosome 
21. Accordingly, polynucleotides related to this invention are useful as a marker in 
linkage analysis for chromosome 21. 
30 This gene is expressed primarily in fetal tissues and tumors thereof. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental diseases and/or disorders, particularly cancers. Similarly, 
35 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system. 
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expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues or cell types (e.g., developmental, and cancerous and wounded 
tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, amniotic fluid, synovial 
fluid and spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
66 as residues: Tyr-29 to Gln-46. 

The tissue distribution in fetal tissues and tumors thereof, combined withh the 
homology to a human ubiquitin-specific protease, indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for the treatment and/or diagnosis of 
cancers and developmental disorders. Moreover, polynucleotides and polypeptides 
corresponding to this gene are useful for the diagnosis, detection, and/or treatment of 
developmental disorders, and may be a key player in the proliferation, maintenance, 
1 5 and/or differentiation of various cell types during development. It may also act as a 
morphogen to control cell and tissue type specification. Because of potential roles in 
proliferation and differentiation, this gene product may have applications in the adult for 
tissue regeneration and the treatment of cancers. 

Expression within fetal tissue and other cellular sources marked by proliferating 
cells indicates that this protein may play a role in the regulation of cellular division, and 
may show utUity in the diagnosis and/or treatment of cancer and other proliferative 
disorders. Similarly, developmental tissues rely on decisions involving cell 
differentiation and/or apoptosis in pattern formation. Dysregulation of apoptosis can 
result in inappropriate suppression of cell death, as occurs in the development of some 
25 cancers, or in failure to control the extent of cell death, as is believed to occur in 

acquired immunodeficiency and certain neurodegenerative disoiders, such as spinal 
muscular atrophy (SMA). Therefore, the polynucleotides and polypeptides of tiie 
present invention are useful in treating, detecting, and/or preventing said disorders and 
conditions, in addition to other types of degenerative conditions. Thus this protein may 
30 modulate apoptosis or tissue differentiation and is useful in the detection, u-eatment, 
and/or prevention of degenerative or proliferative conditions and diseases. Protein, as 
well as, antibodies directed against the protein may show utility as a tumor marker 
and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
35 and accessible through sequence databases. Some of these sequences are related to SEQ 
n> NO:33 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
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scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 2642 of SEQ ID NO: 33, b is an integer of 15 
5 to 2656, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:33, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 24 

In specific embodiments, polypeptides of the invention comprise the following 
1 0 amino acid sequence: QIATSVHHNINRKKRSVLRLL (SEQ ID NO: 152). 

Polynucleotides encoding these polypeptides are also encompassed by the invention. 
This gene is expressed primarily in fetal heart tissue. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
1 5 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, developmental and cardivascular diseases and/or disorders, particularly 
heart diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
20 the heart, expression of this gene at significantly higher or lower levels may be 

routinely detected in certain tissues or cell types (e.g., developmental, cardiovascular, 
and cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, amniotic 
fluid, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
25 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
67 as residues: Ser-19 to Ser-25, Pro-27 to Gly-33, Pro-40 to Asn-47, Pro-65 to Gin- 
70. 

30 The tissue distribution in fetal heart tissue indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for diagnosing and/or treating heart 
diseases. The protein is useful in treating and/or detecting, but not limited to, the 
following: congenital birth defects, myocardial infarction, atherosclerosis, 
arteriosclerosis, endocarditis, cardiomyopathies, and myocarditis. 

35 Moreover, the expression within fetal tissue indicates this protein may play a 

role in the regulation of cellular division, and may show utility in the diagnosis and 
treatment of cancer and other proliferative disorders. Similarly, developmental tissues 
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rely on decisions involving cell differentiation and/or apoptosis in pattern formation. 
Dysregulalion of apoptosis can result in inappropriate suppression of cell death, as 
occurs in the development of some cancers, or in failure to control the extent of cell 
death, as is believed to occur in acquired immunodeficiency and certain 
5 neurodegenerative disorders, such as spinal muscular atrophy (SMA). Therefore, the 
polynucleotides and polypeptides of the present invention are useful in treating, 
detecting, and/or preventing said disorders and conditions, in addition to other types of 
degenerative conditions. Thus this protein may modulate apoptosis or tissue 
differentiation and is useful in the detection, treatment, and/or prevention of 

10 degenerative or proliferative conditions and diseases. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or immunotherapy 
targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

1 5 ID NO:34 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

20 a-b, where a is any integer between 1 to 2552 of SEQ ID NO: 34, b is an integer of 15 
to 2566, where both a and b correspond to tiie positions of nucleotide residues shown 
in SEQ ID NO:34, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 
25 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: PLLRGLFIRXRAGHYECVFHEXVEGGACCEQC (SEQ ID 
NO: 153); LVNNSFFLEnYRPDSKNWQYQETIKKGDLLLNRVQKLSRVINM 
(SEQ ID NO: 154); and/or IRELSRFIAAGRLHCKIDKVNEIVETNRYSHFSE (SEQ 
ID NO: 155). Polynucleotides encoding Uiese polypeptides are also encompassed by the 
30 invention. 

This gene is expressed primarily in activated T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
35 not limited to, immune and hematopoietic disorders and/or diseases. Similarly, 
polypeptides and antibodies directed to these polypeptides are usefiil in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
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number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
68 as residues: Gin-23 to Asn-28, Gly-38 to Ile-43. 

The tissue distribution in activated T-cells indicates polynucleotides and 
polypeptides corresponding to this gene are useful for the diagnosis and/or treatment of 
a variety of immune system disorders. Morever, the expression of this gene product 
indicates a role in regulating the proUferation; survival; differentiation; and/or activation 
of hematopoietic cell lineages, including blood stem cells. This gene product may be 
involved in the regulation of cytokine production, antigen presentation, or other 
processes suggesting a usefulness in the treatment of cancer (e.g. by boosting immune 
responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene product 
may be involved in immune functions. Therefore it may be also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, inflanmiatory bowel 
disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, such as 
T-cell mediated cytotoxicity; inmiune reactions to transplanted organs and tissues, such 
as host-versus-graft and graft-versus-host diseases, or autoimmunity disorders, such as 
autoinunune infertility, lense tissue injury, demyelination, systemic lupus 
erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's disease, 
scleroderma and tissues. Moreover, the protein may represent a secreted factor that 
influences the differentiation or behavior of other blood cells, or that recmits 
hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and conunitted progenitors of various 
blood lineages, and in the differentiation and/or proliferation of various cell types. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or inmiunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO: 35 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
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scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1654 of SEQ ID NO:35, b is an integer of 15 
to 1668, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:35, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

The translation product of this gene shares sequence homology with 
glutathione-S-transferase, which is thought to be important in inflammatory responses. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: GSQPPGPVPEXLIRIYSMRFCPYSHRTRLVLKAKDIRHE 
VVNINLRNKPEWYYTKHPFGHIPVLETSQCQLIYESVIACEYLDDAYPGRK1.FP 
YDPYERARQKMLLELFCKVPHLTKECLVALRCGRECTNLKAALRQ^^^ 
EYQNTTFFGGTCISMIDYLLWPWFERLDVYGILDCVSHTPACGSGYQP (SEQ ID 
NO:156); LASPFPVPLHRCSA (SEQ ID NO:157); MRFCPYSHRTRLVLKAKDIRH 
EWNINLR (SEQ ID NO: 158); NKPEWYYTKHPFGHIPVLETSQCQ (SEQ ID 
NO: 159); KLFPYDPYERARQKMLLELFCKVP (SEQ ID NO: 160); VALRCGRECT 
NLKAALRQEFSNLEE (SEQ ID NO: 161); AAGCVWDTGLCEPHXSLRLWISAM 
KWDPTVCALLMDKSIFQGFLNLYFQNNPNAFDFGLC (SEQ ID NO: 163); and/or 
SMIDYLLWPWFERLDVYGILDCVS (SEQ ID NO: 162). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. 

This gene is expressed primarily in keratinocytes, melanocytes, and fetal skin 

tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, integumentary, inflammatory, and/or developmental diseases and/or 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing inmiunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the skin, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues or cell types (e.g., integumentary, inflammatory, 
developmental, metabolic, and cancerous and wounded tissues) or bodily fluids (e.g., 
lymph, serum, plasma, urine, synovial fluid and spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue from an individual not 
having the disorder. 

The tissue distribution in integumentary cells and tissues, combined with the 
homology to glutathione-S-transferase, indicates that the protein products of this gene 

5 are useful for the diagnosis and treatment of inflammatory and skin diseases. Moreover, 
polynucleotides and polypeptides corresponding to this gene are useful for the 
treatment, diagnosis, and/or prevention of various skin disorders including congenital 
disorders (i.e. nevi, moles, freckles, Mongolian spots, hemangiomas, port-wine 
syndrome), integumentary tumors (i.e. keratoses, Bowen's disease, basal cell 

10 carcinoma, squamous cell carcinoma, malignant melanoma, Paget 's disease, mycosis 
fungoides, and Kaposi's sarcoma), injuries and inflammation of the skin (i.e.wounds, 
rashes, prickly heat disorder, psoriasis, dermatitis), atherosclerosis, uticaria, eczema, 
photosensitivity, autoimmune disorders (i.e. lupus erythematosus, vitiligo, 
dermatomyositis, morphea, scleroderma, pemphigoid, and pemphigus), keloids, striae, 

15 erythema, petechiae, purpura, and xanthelasma. In addition, such disorders may 

predispose increased susceptibility to viral and bacterial infections of the skin (i.e. cold 
sores, warts, chickenpox, molluscum contagiosum, herpes zoster, boils, cellulitis, 
erysipelas, impetigo, tinea, althletes foot, and ringworm). 

Moreover, the protein product of this gene may also be useful for the treatment 

20 or diagnosis of various connective tissue disorders such as arthritis, trauma, tendonitis, 
chrondomalacia and inflammation, autoimmune disorders such as rheumatoid arthritis, 
lupus, scleroderma, and dermatomyositis as well as dwarfism, spinal deformation, and 
specific joint abnormalities as well as chondrodysplasias (i.e. spondyloepiphyseal 
dysplasia congenita, familial osteoarthritis, Atelosteogenesis type n, metaphyseal 

25 chondrodysplasia type Schmid). Protein, as well as, antibodies directed against the 

protein may show utility as a tumor marker and/or immunotherapy targets for the above 
listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

30 ID NO: 36 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

35 a-b, where a is any integer between 1 to 969 of SEQ ID NO:36, b is an integer of 15 to 
983, where both a and b correspond to the positions of nucleotide residues shown in 
SEQ ID NO:36, and where b is greater than or equal to a + 14. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 27 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequences: VYLFLTYRQAWIALLVKVGVISEKHTWEWQTVEAVATG 
5 LQDPnCffiMFLAAIAHHYTFSYKPYVQEAEEGSCFDSFLAMWDVSDIRD^^ 

QVRHVGRTVRGHPRKKLFPEDQDQNEHTSLLSSSSQDAISIASSMPPSPMGHY 
QGFGHTVTPQTTPTTAK ISDEILSDTIGEKKEPS (SEQ ID NO: 164); TNNKDSLG 
WYLFr\a.DSWL\LKYPGIAIYVDTCRECYEAYVrmFMGFLT^ 
ILEAKDQQKHFPPLCCCPPWAMGEVLLFRCKLSVLQYTWRPFTTIVALICELX 
10 r«3EGNFSFSNAWTYLVnNNMSQLFAMYCLLLFYKVLKEEl^PIQPVGK^ 

KLVVF (SEQ ID NO: 165); and/or QNSQRTGLPITIFSRSFPLLTGSDLCEN (SEQ ID 
NO: 166). Polynucleotides encoding these polypeptides are also encompassed by the 
invention. The gene encoding the disclosed cDNA is believed to reside on chromosome 
4. Accordingly, polynucleotides related to this invention are useful as a marker in 
15 linkage analysis for chromosome 4. 

This gene is expressed primarily in retinal tissue, and to a lesser extent in 
keratinocytes, T-helper cells, endometrial tumor cells and infant brain tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reigents for differential identification of die tissue(s) or cell type(s) present in a 
20 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, visual and immune diseases and/or disorders. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, expression of this gene at significantly higher or 
25 lower levels may be routinely detected in certain tissues or cell types (e.g., visual, 
immune, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fluid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue from an individual not having the 
30 disorder. 

Preferred epitopes include those comprising a sequence shown in SEQ ID NO: 
70 as residues: Thr-6 to Trp-13. 

The tissue distribution is retinal tissue indicates that polynucleotides and 
polypeptides corresponding to this gene are useful in the treatment and/or diagnosis of 
35 visual disorders, which include, but are not limited to glaucoma, retinal/macular 

degeneration, cataracts, conjunctavitis, and/or autoimmune disorders. Morever, the 
expression of this gene product in immune tissues indicates a role in regulating the 
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proliferation; survival; differentiation; and/or activation of hematopoietic cell lineages, 
including blood stem cells. This gene product may be involved in the regulation of 
cytokine production, antigen presentation, or other processes suggesting a usefulness in 
the treatment of cancer (e.g. by boosting immune responses). 
5 Since the gene is expressed in cells of lymphoid origin, the natural gene product 

may be involved in immune functions. Therefore it may be also used as an agent for 
immunological disorders including arthritis, asthma, inmiunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, inflanmiatory bowel 
disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, such as 
10 T-cell mediated cytotoxicity; inmiune reactions to transplanted organs and tissues, such 
as host-versus-graft and graft-versus-host diseases, or autoimmunity disorders, such as 
autoimmune infertility, lense tissue injury, demyelination, systemic lupus 
erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's disease, 
scleroderma and tissues. Moreover, the protein may represent a secreted factor that 
15 influences the differentiation or behavior of other blood cells, or that recmits 
hematopoietic cells to sites of injury. In addition, this gene product may have 
commercial utility in the expansion of stem cells and committed progenitors of various 
blood lineages, and in the differentiation and/or proliferation of various cell types. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
20 marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:37 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
25 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 2337 of SEQ ID NO:37, b is an integer of 15 
to 235 1, where both a and b correspond to the positions of nucleotide residues shown 
30 in SEQ ID NO:37, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 28 

When tested against Jurkat and U937 cell lines, supematants removed from 
cells containing this gene activated the GAS (gamma activating sequence) promoter 
35 element. Thus, it is likely that this gene activates promyelocytic and T-cells, and to a 
lesser extent, immune cell and tissues, through the JAK-STAT signal transduction 
pathway. GAS is a promoter element found upstream of many genes which are 
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involved in the Jak-STAT pathway. The Jak-STAT pathway is a large, signal 
transduction pathway involved in the differentiation and proliferation of cells. 
Therefore, activation of the Jak-STAT pathway, reflected by the binding of the GAS 
element, can be used to indicate proteins involved in the proliferation and differentiation 
of cells. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: QFFLCRDCS (SEQ ID NO: 1 67); ERESCSIIQAGVQWCNLSSL 
RPPPPGFKQFSHLSLPSS (SEQ ID NO:168); LRENLALSSRLECSGAISAHCD 
LHLLGSSNSPTSASQVVRTTGAHHQAQPIFVFLVETGFHHVGQAHLKQLTSRY 
PPHLASQSAGITGMSYRTQPKLLWFYLYKQFKQYREVGSRK (SEQ ID NO: 169); 
SSRLECSGAISAHCDLHLLGSSNSP (SEQ ID NO: 170); GAHHQAQPIFVFLVET 
GFHHVGQAHLKQLTSRYPPHLASQ (SEQ ID NO:171); and/or ITGMSYRTQPKL 
LWFYL YKQFKQYR (SEQ ID NO: 1 72). Polynucleotides encoding these polypeptides 
are also encompassed by the invention. 

This gene is expressed primarily in kidney tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to. renal and/or urogenital diseases and/or conditions. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or celi type(s). For a 
number of disorders of the above tissues or cells, particularly of the kidney, expression 
of this gene at significanUy higher or lower levels may be routinely detected in certain 
tissues or cell types (e.g., renal, urogenital, and cancerous and wounded tissues) or 
25 bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid and spinal fluid) or 

another tissue or cell sample taken fiDm an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution in kidney tissue, combined with the detected GAS 
biological activity, indicates that polynucleotides and polypeptides corresponding to this 
gene are useful for diagnosing and/or treating kidney diseases. Moreover, tiie protein 
product of this gene could be used in Uie treatment and/or detection of kidney diseases 
including renal failure, nephritus, renal tubular acidosis, proteinuria, pyuria, edema, 
pyelonephritis, hydronephritis. nephrotic syndrome, crush syndrome, 
glomerulonephritis, hematuria, renal coUc and kidney stones, in addition to Wilm's 
Tumor Disease, and congenital kidney abnormaUties such as horseshoe kidney, 
polycystic kidney, and Falconi's syndrome. Alternatively, expression of this gene 
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product in the testis may implicate this gene product in normal testicular function. In 
addition, this gene product may be useful in the treatment of male infertility, and/or 
could be used as a male contraceptive. Moreover, conditions such as infertility and 
reduced sperm count can be assessed using the invention to determine whether the 

5 condition is associated with or caused by the occurrence of the gene or gene alteration. 
Protein, as well as, antibodies directed against the protein may show utility as a tumor 
marker and/or immunotherapy targets for the above hsted tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

10 ID NO:38 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

15 a-b, where a is any integer between 1 to 1520 of SEQ ID NO:38, b is an integer of 15 
to 1534, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:38, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 29 
20 In specific embodiments, polypeptides of the invention comprise the following 

amino acid sequence: ENFPETREVRAFSPRENLELCTCKS (SEQ ID NO: 173). 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 
This gene is expressed primarily in K562 cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune or hematopoietic diseases and/or conditions, particularly 
leukemia. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
30 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues or cell types (e.g., inmiune, hematopoietic, and 
cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, 
synovial fluid and spinal fluid) or another tissue or cell sample taken from an individual 
35 having such a disorder, relative to the standard gene expression level, i.e., the 

expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 
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The tissue distribution in K562 cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosing and/or treating 
leukentua. The protein product of this gene is useful for the treatment and diagnosis of 
hematopoietic related disorders such as anemia, pancytopenia, leukopenia, 
5 thrombocytopenia or leukemia since stromal cells are important in the production of 
cells of hematopoietic lineages. The uses include bone marrow cell ex- vivo culture, 
bone marrow transplantation, bone marrow reconstitution, radiotherapy or 
chemother^y of neoplasia. The gene product may also be involved in lymphopoiesis, 
therefore, it can be used in immune disorders such as infection, inflammation, allergy, 

10 immunodeficiency etc. In addition, this gene product may have commercial utility in the 
expansion of stem cells and committed progenitors of various blood lineages, and in the 
differentiation and/or proliferation of various cell types. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or immunotherapy 
targets for the above listed tissues. 

15 Many polynucleotide sequences, such as EST sequences, are publicly available 

and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:39 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 

20 Accordingly, preferably excluded from the present invention arc one or more 

polynucleotides comprising a nucleotide sequence described by the general formula of 
a-b, where a is any integer between 1 to 1 1 68 of SEQ ID NO:39, b is an integer of 15 
to 1 182, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:39, and where b is greater than or equal to a + 14. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 30 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: ALYCSPSLQID (SEQ ID NO: 174). Polynucleotides encoding 
these polypeptides are also encompassed by the invention. 

30 This gene is expressed primarily in activated T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, immune and hematopoietic diseases and/or disorders. Similarly, 

35 polypeptides and antibodies directed to Uiese polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system. 
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expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues or cell types (e.g., immune, hematopoietic, and cancerous and 
wounded tissues) or bodily fluids (e.g., lymph, serum, plasma, urine, synovial fluid 
and spinal fluid) or another tissue or cell sample taken from an individual having such a 
5 disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in activated T-cells indicates that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosing and/or treating 
immune disorders. Morever, the expression of this gene product indicates a role in 
10 regulating the proliferation; survival; differentiation; and/or activation of hematopoietic 
cell lineages, including blood stem cells. This gene product may be involved in the 
regulation of cytokine production, antigen presentation, or other processes suggesting a 
usefulness in the treatment of cancer (e.g. by boosting immune responses). 

Since the gene is expressed in cells of lymphoid origin, the natural gene product 
15 may be involved in immune functions. Therefore it may be also used as an agent for 
immunological disorders including arthritis, asthma, immunodeficiency diseases such 
as AIDS, leukemia, rheumatoid arthritis, granulomatous disease, inflammatory bowel 
disease, sepsis, acne, neutropenia, neutrophilia, psoriasis, hypersensitivities, such as 
T-cell mediated cytotoxicity; immune reactions to transplanted organs and tissues, such 
20 as host-versus-graft and graft-versus-host diseases, or autoimmunity disorders, such as 
autoinunune infertility, lense tissue injury, demyelination, systemic lupus 
erythematosis, drug induced hemolytic anemia, rheumatoid arthritis, Sjogren's disease, 
scleroderma and tissues. 

Moreover, the protein may represent a secreted factor that influences the 
25 differentiation or behavior of other blood cells, or that recruits hematopoietic cells to 
sites of injury. In addition, this gene product may have commercial utility in the 
expansion of stem cells and committed progenitors of various blood lineages, and in the 
differentiation and/or proliferation of various cell types. Protein, as well as, antibodies 
directed against the protein may show utility as a tumor marker and/or immunotherapy 
30 targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 
ID NO:40 and may have been publicly available prior to conception of the present 
invention. Preferably, such related polynucleotides are specifically excluded from the 
35 scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 



10 



wo 99/46289 PCT/US99/05721 

52 

a-b, where a is any integer between 1 to 1827 of SEQ ID NO:40, b is an integer of 15 
to 1841, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:40, and where b is greater than or equal to a + 14. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 31 

The translation product of this gene was shown to have homology to the human 
AF-6 gene product (See Genbank Accession No.gnllPIDId 1033446 (ABOl 1399)), 
which is thought to be important in the predisposition of acute myeloid leukemia. 

In specific embodiments, polypeptides of the invention comprise the following 
amino acid sequence: CHCSMLKSHGDVQNVLTLFVTVLSDVSYLQQIQKKLR 
(SEQ ID NO: 175); and/or CYFHQKAQSNGPEKQEKEGVIQNFKRTLSKKEK 
KEKKKK (SEQ ID NO: 176). Polynucleotides encoding these polypeptides are also 
encompassed by the invention. The gene encoding the disclosed cDNA is believed to 
reside on chromosome 6. Accordingly, polynucleotides related to this invention are 
15 useful as a marker in linkage analysis for chromosome 6. 

This gene is expressed primarily in merkel cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

20 not limited to, inmiune and hematopoietic disorders and/or diseases, particularly 

leukemias. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the inmiune system, expression of this gene at significantly higher or lower levels may 

25 be routinely detected in certain tissues or cell types (e.g., immune, hematopoietic, 

leukemic, and cancerous and wounded tissues) or bodily fluids (e.g., lymph, serum, 
plasma, urine, synovial fiuid and spinal fluid) or another tissue or cell sample taken 
from an individual having such a disorder, relative to the standard gene expression 
level, i.e., the expression level in healthy tissue or bodily fluid from an individual not 

30 having the disorder. 

The tissue distribution in merkel cells, combined with the homology to the AF-6 
gene, indicates that polynucleotides and polypeptides corresponding to this gene are 
useful for diagnosing and/or treating inunune disorders. The protein product of this 
gene is useful for the treatment and/or diagnosis of hematopoietic related disorders such 

35 as anemia, pancytopenia, leukopenia, thrombocytopenia or leukemia since stromal ceUs 
are important in the production of cells of hematopoietic Uneages. The uses include 
bone marrow cell ex- vivo culture, bone marrow transplantation, bone marrow 
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reconstitution, radiotherapy or chemotherapy of neoplasia. The gene product may also 
be involved in lymphopoiesis, therefore, it can be used in immune disorders such as 
infection, inflammation, allergy, immunodeficiency etc. In addition, this gene product 
may have commercial utility in the expansion of stem ceils and committed progenitors 

5 of various blood hneages, and in the differentiation and/or proliferation of various cell 
types. Protein, as well as, antibodies directed against the protein may show utility as a 
tumor marker and/or immunotherapy targets for the above listed tissues. 

Many polynucleotide sequences, such as EST sequences, are publicly available 
and accessible through sequence databases. Some of these sequences are related to SEQ 

10 ID NO:41 and may have been publicly available prior to conception of the present 

invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. To list every related sequence is cumbersome. 
Accordingly, preferably excluded from the present invention are one or more 
polynucleotides comprising a nucleotide sequence described by the general formula of 

15 a-b, where a is any integer between 1 to 1 1 83 of SEQ ID NO:41 , b is an integer of 1 5 
to 1 197, where both a and b correspond to the positions of nucleotide residues shown 
in SEQ ID NO:41, and where b is greater than or equal to a + 14. 
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Table 1 summarizes the infomiation corresponding to each "Gene No." 
described above. The nucleotide sequence identified as *'NT SEQ ID NO:X" was 
assembled from partially homologous ("overlapping") sequences obtained from the 
"cDNA clone ID" identified in Table 1 and, in some cases, from additional related DNA 
5 clones. The overlapping sequences were assembled into a single contiguous sequence 
of high redundancy (usually three to five overlapping sequences at each nucleotide 
position), resulting in a final sequence identified as SEQ ID NO:X, 

The cDN A Clone ID was deposited on the date and given the corresponding 
deposit number listed in "ATCC Deposit No:Z and Date." Some of the deposits contain 
10 multiple different clones corresponding to the same gene. "Vector" refers to the type of 
vector contained in the cDNA Clone ID. 

"Total NT Seq.'* refers to the total number of nucleotides in the contig identified 
by "Gene No." The deposited clone may contain all or most of these sequences, 
reflected by the nucleotide position indicated as "5' NT of Clone Seq." and the "3* NT 
1 5 of Clone Seq." of SEQ ID NO:X. The nucleotide position of SEQ ID NO:X of the 
putative start codon (methionine) is identified as "5' NT of Start Codon." Similarly , 
the nucleotide position of SEQ ED NO:X of the predicted signal sequence is identified as 
"5' NT of First AA of Signal Pep." 

The translated amino acid sequence, beginning with the methionine, is identified 
20 as "AA SEQ ID NO: Y," although other reading frames can also be easily translated 
using known molecular biology techniques. The polypeptides produced by these 
altemative open reading frames are specifically contemplated by the present invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
peptide is identified as "First AA of Sig Pep" and "Last AA of Sig Pep." The predicted 
25 first amino acid position of SEQ ID NO: Y of the secreted portion is identified as 

"Predicted First AA of Secreted Portion," Finally, the amino acid position of SEQ ID 
NO: Y of the last amino acid in the open reading frame is identified as "Last AA of 
ORF," 

SEQ ID NO:X and the translated SEQ ID NO: Y are sufficiently accurate and 
30 otherwise suitable for a variety of uses well known in the art and described further 

below. For instance, SEQ ID NO:X is useful for designing nucleic acid hybridization 
probes that will detect nucleic acid sequences contained in SEQ ID NO:X or the cDNA 
contained in the deposited clone. These probes will also hybridize to nucleic acid 
molecules in biological samples, thereby enabling a variety of forensic and diagnostic 
35 methods of the invention. Similarly, polypeptides identified from SEQ ID NO:Y may 
be used to generate antibodies which bind specifically to the secreted proteins encoded 
by the cDNA clones identified in Table 1 . 
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Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino acid 
5 sequence. In these cases, the predicted amino acid sequence diverges from the actual 
amino acid sequence, even though the generated DNA sequence may be greater than 
99.9% identical to the actual DNA sequence (for example, one base insertion or deletion 
in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide 

10 sequence or the amino acid sequence, the present invention provides not only the 

generated nucleotide sequence identified as SEQ ID NO:X and the predicted translated 
amino acid sequence identified as SEQ ID NO:Y, but also a sample of plasmid DNA 
containing a human cDNA of the invention deposited with the ATCC, as set forth in 
Table 1 . The nucleotide sequence of each deposited clone can readily be determined by 

15 sequencing the deposited clone in accordance with known methods. The predicted 
amino acid sequence can then be verified from such deposits. Moreover, the amino 
acid sequence of the protein encoded by a particular clone can also be directly 
determined by peptide sequencing or by expressing the protein in a suitable host cell 
containing the deposited human cDNA, collecting the protein, and determining its 

20 sequence. 

The present invention also relates to the genes corresponding to SEQ ID NO:X, 
SEQ ID NO: Y, or the deposited clone. The corresponding gene can be isolated in 
accordance with known methods using the sequence information disclosed herein. 
Such methods include preparing probes or primers from the disclosed sequence and 
25 identifying or amplifying the corresponding gene from appropriate sources of genomic 
material. 

Also provided in the present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers from the 
sequences provided herein and screening a suitable nucleic acid source for the desired 
30 homologue. 

The polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well 
35 understood in the art. 

The polypeptides may be in the form of the secreted protein, including the 
mature form, or may be a part of a larger protein, such as a fusion protein (see below). 
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It is often advantageous to include an additional amino acid sequence which contains 
secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 

5 The polypeptides of the present invention are preferably provided in an isolated 

form, and preferably are substantially purified. A recombinantly produced version of a 
polypeptide, including the secreted polypeptide, can be substantially purified by the 
one-step method described in Smith and Johnson, Gene 67:31-40 (1988). 
Polypeptides of the invention also can be purified from natural or recombinant sources 

10 using antibodies of the invention raised against the secreted protein in methods which 
are well known in the art. 

Si gnal Sequences 

Methods for predicting whether a protein has a signal sequence, as well as the 

15 cleavage point for that sequence, are available. For instance, the method of McGeoch, 
Virus Res. 3:271-286 (1985), uses the information from a short N-terminal charged 
region and a subsequent uncharged region of the complete (uncleaved) protein. The 
method of von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the information 
from the residues surrounding the cleavage site, typically residues -13 to +2, where +1 

20 indicates the amino terminus of the secreted protein. The accuracy of predicting the 

cleavage points of known mammalian secretory proteins for each of these methods is in 
the range of 75-80%. (von Heinje, supra.) However, the two methods do not always 
produce the same predicted cleavage point(s) for a given protein. 

In the present case, the deduced amino acid sequence of the secreted polypeptide 

25 was analyzed by a computer program called Signal? (Henrik Nielsen et al.. Protein 
Engineering 10: 1-6 (1997)), which predicts the cellular location of a protein based on 
the amino acid sequence. As part of this computational prediction of localization, the 
methods of McGeoch and von Heinje are incorporated. The analysis of the amino acid 
sequences of the secreted proteins described herein by this program provided the results 

30 shown in Table 1. 

As one of ordinary skill would appreciate, however, cleavage sites sometimes 
vary from organism to organism and cannot be predicted with absolute certainty. 
Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO:Y which have an N-terminus beginning within 5 residues (i.e., + 

35 or - 5 residues) of the predicted cleavage point. Similarly, it is also recognized that in 
some cases, cleavage of the signal sequence from a secreted protein is not entirely 
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uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleotides encoding such polypeptides, are contemplated by the present invention. 

Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturally occurring signal sequence. For example, the naturally 
5 occurring signal sequence may be further upstream from the predicted signal sequence. 
However, it is likely that the predicted signal sequence will be capable of directing the 
secreted protein to the ER. These polypeptides, and the polynucleotides encoding such 
polypeptides, are contemplated by the present invention, 

10 Polynucleotide and Polypeptide Variants 

"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential properties 
thereof. Generally, variants are overall closely similar, and, in many regions, identical 
to the polynucleotide or polypeptide of the present invention. 

1^ ^ polynucleotide having a nucleotide sequence at least, for example, 95% 

"identical" to a reference nucleotide sequence of the present invention, it is intended that 
the nucleotide sequence of the polynucleotide is identical to the reference sequence 
except that the polynucleotide sequence may include up to five point mutations per each 
100 nucleotides of the reference nucleotide sequence encoding the polypeptide. In other 

20 words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to 
a reference nucleotide sequence, up to 5% of the nucleotides in the reference sequence 
may be deleted or substituted with another nucleotide, or a number of nucleotides up to 
5% of the total nucleotides in the reference sequence may be inserted into the reference 
sequence. The query sequence may be an entire sequence shown inTable 1, the ORF 

25 (open reading frame), or any fragement specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 
computer programs. A preferred method for determing the best overall match between 

30 a query sequence (a sequence of the present invention) and a subject sequence, also 
referred to as a global sequence alignment, can be determined using the FASTDB 
computer program based on the algorithm of Brutlag et ah (Comp. App. Biosci. (1990) 
6:237-245). In a sequence alignment the query and subject sequences are both DNA 
sequences. An RNA sequence can be compared by converting U's to T's. The result 

35 of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB alignment of DNA sequences to calculate percent identiy are: 
Matrix=Umtary, k-tuple=4. Mismatch Penalty=l, Joining Penalty=30, Randomization 



BNSDOCfD: <WO 994€289A1J_> 



wo 99/46289 



PCT/US99/05721 



Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window 
Size=500 or the lenght of the subject nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 

5 results. This is because the FASTDB program does not account for 5' and 3' 

truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or 3' ends, relative to the the query sequence, the percent 
identity is corrected by calculating the number of bases of the query sequence that are 5' 
and 3' of the subject sequence, which are not matched/aligned, as a percent of the total 

10 bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from 
the percent identity, calculated by the above FASTDB program using the specified 
parameters, to arrive at a final percent identity score. This corrected score is what is 
used for the purposes of the present invention. Only bases outside the 5' and 3' bases 

15 of the subject sequence, as displayed by the FASTDB alignment, which are not 

matched/aligned with the query sequence, are calculated for the purposes of manually 
adjusting the percent identity score. 

For example, a 90 base subject sequence is aligned to a 100 base query 
sequence to determine percent identity. The deletions occur at the 5' end of the subject 

20 sequence and therefore, the FASTDB alignment does not show a matched/alignement of 
the first 10 bases at 5' end. The 10 unpaired bases represent 10% of the sequence 
(number of bases at the 5' and 3' ends not matched/total number of bases in the query 
sequence) so 10% is subtracted from the percent identity score calculated by the 
FASTDB program. If the remaining 90 bases were perfectly matched the final percent 

25 identity would be 90%. In another example, a 90 base subject sequence is compared 
with a 100 base query sequence. This time the deletions are internal deletions so that 
there are no bases on the 5' or 3' of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not manually 
corrected. Once again, only bases 5' and 3' of the subject sequence which are not 

30 matched/aligned with the query sequnce are manually corrected for. No other manual 
corrections are to made for the purposes of the present invention. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 

35 except that the subject polypeptide sequence may include up to five amino acid 

alterations per each 100 amino acids of the query amino acid sequence. In other words, 
to obtain a polypeptide having an amino acid sequence at least 95% identical to a query 
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amino acid sequence, up to 5% of the amino acid residues in the subject sequence may 
be inserted, deleted, (indels) or substituted with another amino acid. These alterations 
of the reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, 
interspersed either individually among residues in the reference sequence or in one or 
more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 95%, 
96%, 97%, 98% or 99% identical to. for instance, the amino acid sequences shown in 
Table 1 or to the amino acid sequence encoded by deposited DNA clone can be 
determined conventionally using known computer programs. A preferred method for 
determing the best overall match between a query sequence (a sequence of the present 
invention) and a subject sequence, also referred to as a global sequence alignment, can 
be determined using the FASTDB computer program based on the algorithm of Brutiag 
et al. (Comp. App. Biosci. ( 1 990) 6:237-245). In a sequence alignment the query and 
subject sequences are either both nucleotide sequences or both amino acid sequences. 
The result of said global sequence alignment is in percent identity. Preferred parameters 
used in a FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2. Mismatch 
Penalty=l, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=l, 
Window Size=sequence length. Gap Penalty=5, Gap Size Penalty=0.05, Window 
20 Size=500 or the length of the subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, a manual correction must be made 
to the results. This is becuase the FASTDB program does not account for N- and C- 
terminal truncations of the subject sequence when calculating global percent identity. 
25 For subject sequences tiiincated at the N- and C-termini. relative to Uie tiie query 

sequence, the percent identity is corrected by calculating the number of residues of the 
query sequence that are N- and C-terminal of the subject sequence, which are not 
matched/aligned with a corresponding subject residue, as a percent of the total bases of 
the query sequence. Whether a residue is matched/aligned is determined by results of 
30 the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using die specified parameters, to 
arrive at a final percent identity score. This final percent identity score is what is used 
for the puqjoses of the present invention. Only residues to the N- and C-termini of die 
subject sequence, which are not matched/aligned with the query sequence, are 
considered for the purposes of manually adjusting die percent identity score. That is, 
only query residue positions outside the farthest N- and C-terminal residues of the 
subject sequence. 



35 
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For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not show 
a matching/alignment of the first 10 residues at the N-terminus. The 10 unpaired 
5 residues represent 10% of the sequence (number of residues at the N- and C- termini 
not matched/total number of residues in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 
residues were perfectly matched the final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue query sequence. 
10 This time the deletions are internal deletions so there are no residues at the N- or C- 
termini of the subject sequence which are not matched/aligned with the query. In this 
case the percent identity calculated by FASTDB is not manually corrected. Once again, 
only residue positions outside the N- and C-terminal ends of the subject sequence, as 
displayed in the FASTDB alignment, which are not matched/aligned with the query 
1 5 sequnce are manually corrected for. No other manual corrections are to made for the 
purposes of the present invention. 

The variants may contain alterations in the coding regions, non-coding regions, 
or both. Especially preferred are polynucleotide variants containing alterations which 
produce silent substitutions, additions, or deletions, but do not alter the properties or 
20 activities of the encoded polypeptide. Nucleotide variants produced by silent 

substitutions due to the degeneracy of the genetic code are preferred. Moreover, 
variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or added in any 
combination are also preferred. Polynucleotide variants can be produced for a variety 
of reasons, e.g., to optimize codon expression for a particular host (change codons in 
25 the human mRNA to those preferred by a bacterial host such as E. coli). 

Naturally occurring variants are called "allelic variants," and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 
organism. (Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985).) These 
allelic variants can vary at either the polynucleotide and/or polypeptide level. 
30 Alternatively, non-naturally occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA 
technology, variants may be generated to improve or alter the characteristics of the 
polypeptides of the present invention. For instance, one or more amino acids can be 
35 deleted from the N-terminus or C-terminus of the secreted protein without substantial 
loss of biological function. The authors of Ron et al., J. Biol. Chem. 268: 2984-2988 
(1993), reported variant KGF proteins having heparin binding activity even after 
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deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly, Interferon gamma 
exhibited up to ten times higher activity after deleung 8-10 amino acid residues from the 
carboxy terminus of this protein. (Dobeli et al., J. Biotechnology 7:199-216 (1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological 
activity similar to that of the naturally occurring protein. For example, Gayle and 
coworkers (J. Biol. Chem 268:22105-221 1 1 (1993)) conducted extensive mutational 
analysis of human cytokine IL-la. They used random mutagenesis to generate over 
3,500 individual IL-la mutants that averaged 2.5 amino acid changes per variant over 
the entire length of the molecule. Multiple mutations were examined at every possible 
amino acid position. The investigators found that "[m]ost of the molecule could be 
altered with little effect on either [binding or biological activity]." (See, Abstract.) In 
fact, only 23 unique amino acid sequences, out of more than 3,500 nucleotide 
sequences examined, produced a protein that significantly differed in activity from wild- 
type. 

Furthermore, even if deleting one or more amino acids from the N-terminus or 
C-terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of a 
deletion variant to induce and/or to bind antibodies which recognize the secreted form 
will likely be retained when less than the majority of the residues of the secreted form 
are removed from the N-terminus or C-teraiinus. Whether a particular polypeptide 
lacking N- or C-terminal residues of a protein retains such immunogenic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. 

Thus, the invention fiuther includes polypeptide variants which show 
substantial biological activity. Such variants include deletions, insertions, inversions, 
repeats, and substitutions selected according to general rules known in the an so as 
have litde effect on activity. For example, guidance concerning how to make 
phenotypically silent amino acid substitutions is provided in Bowie. J. U. et al.. 
Science 247: 1306-1310 (1990), wherein the authors indicate that there are two main 
strategies for studying the tolerance of an amino acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
selection during die process of evolution. By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein function. In contrast, die amino acid positions 
where substitutions have been tolerated by natural selection indicates that tiiese 
positions are not critical for protein function. Thus, positions tolerating amino acid 
substitution could be modified while still maintaining biological activity of Uie protein. 
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The second strategy uses genetic engineering to introduce amino acid changes at 
specific positions of a cloned gene to identify regions critical for protein function. For 
example, site directed mutagenesis or alanine-scanning mutagenesis (introduction of 
single alanine mutations at every residue in the molecule) can be used. (Cunningham 

5 and Wells, Science 244:1081-1085 (1989).) The resulting mutant molecules can then 
be tested for biological activity. 

As the authors state, these tv^o strategies have revealed that proteins are 
surprisingly tolerant of amino acid substitutions. The authors further indicate which 
amino acid changes are likely to be permissive at certain amino acid positions in the 

10 protein. For example, most buried (within the tertiary structure of the protein) amino 
acid residues require nonpolar side chains, whereas few features of surface side chains 
are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of the aliphatic or hydrophobic amino acids Ala, Val, Leu and He; 
replacement of the hydroxyl residues Ser and Thr; replacement of the acidic residues 

15 Asp and Glu; replacement of the amide residues Asn and Gin, replacement of the basic 
residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, and Trp, 
and replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention 
include (i) substitutions with one or more of the non-conserved amino acid residues, 

20 where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution with one or more of amino acid residues having a 
substituent group, or (iii) fusion of the mature polypeptide with another compound, 
such as a compound to increase the stability and/or solubility of the polypeptide (for 
example, polyethylene glycol), or (iv) fusion of the polypeptide with additional amino 

25 acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 

For example, polypeptide variants containing amino acid substitutions of 
charged amino acids with other charged or neutral amino acids may produce proteins 

30 with improved characteristics, such as less aggregation. Aggregation of pharmaceutical 
formulations both reduces activity and increases clearance due to the aggregate's 
immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:331-340 (1967); 
Robbins et al.. Diabetes 36: 838-845 (1987); Cleland et al., Crit. Rev. Therapeutic 
Dmg Carrier Systems 10:307-377 (1993).) 

35 A further embodiment of the invention relates to a polypeptide which 

comprises the amino acid sequence of the present invention having an amino acid 
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sequence which contains at least one amino acid substitution, but not more than 50 
amino acid substitutions, even more preferably, not more than 40 amino acid 
substitutions, still more preferably, not more than 30 amino acid substitutions, and 
still even more preferably, not more than 20 amino acid substitutions. Of course, in 
5 order of ever-increasing preference, it is highly preferable for a polypeptide to have an 
amino acid sequence which comprises the amino acid sequence of the present 
mvention, which contains at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 
amino acid substitutions. In specific embodiments, the number of additions, 
substitutions, and/or deletions in the amino acid sequence of the present invention or 
10 fragments thereof (e.g., the mature form and/or other fragments described herein), is 1 - 
5, 5-10, 5-25, 5-50, 10-50 or 50-150, conservative amino acid substitutions are 
preferable. 



15 



Polvnucipntide and P olypeptide Fragment.*; 

In the present invention, a "polynucleotide fragment" refers to a short 
polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 
15 nt, and more preferably at least about 20 nt, still more preferably at least about 30 nt, 
and even more preferably, at least about 40 nt in length. A fragment "at least 20 nt in 
20 length," for example, is intended to include 20 or more contiguous bases from the 

cDN A sequence contained in the deposited clone or the nucleotide sequence shown in 
SEQ ID NO:X. These nucleotide fragments are useful as diagnostic probes and primers 
as discussed herein. Of course, larger fragments (e.g., 50, 150, 500, 600, 2000 
nucleotides) are preferred. 
25 Moreover, representative examples of polynucleotide fragments of the 

invention, include, for example, fragments having a sequence from about nucleotide 
number 1-50, 51-100, 101-150. 151-200, 201-250, 251-300. 301-350, 351-400, 401- 
450, 451-500, 501-550, 551-600, 651-700, 701-750, 751-800, 800-850, 851-900, 
901-950, 951-1000, 1001-1050, 1051-1100. 1101-1150, 1151-1200. 1201-1250,' 
30 1251-1300. 1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501-1550, 1551-1600, 
1601-1650, 1651-1700. 1701-1750. 1751-1800, 1801-1850, 1851-1900, 1901-195o! 
195 1 -2000, or 200 1 to the end of SEQ ID NO:X or the cDNA contained in die 
deposited clone. In this context "about" includes the particularly recited ranges, larger 
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or smaller by several (5, 4, 3, 2, or 1 ) nucleotides, at either terminus or at both termini. 
Preferably, these fragments encode a polypeptide which has biological activity. More 
preferably, these polynucleotides can be used as probes or primers as discussed herein. 
In the present invention, a "polypeptide fragment" refers to a short amino acid 

5 sequence contained in SEQ ID NO:Y or encoded by the cDNA contained in the 
deposited clone. Protein fragments may be "free-standing," or comprised within a 
larger polypeptide of which the fragment forms a part or region, most preferably as a 
single continuous region. Representative examples of polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20, 21-40, 

10 41-60, 61-80, 81-100, 102-120, 121-140, 141-160, or 161 to the end of the coding 
region. Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 
100, 1 10, 120, 130, 140, or 150 amino acids in length. In this context "about" 
includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
amino acids, at either extreme or at both extremes. 

15 Preferred polypeptide fragments include the secreted protein as well as the 

mature form. Further preferred polypeptide fragments include the secreted protein or 
the mature form having a continuous series of deleted residues from the amino or the 
carboxy terminus, or both. For example, any number of amino acids, ranging from 1- 
60, can be deleted from the amino terminus of either the secreted polypeptide or the 

20 mature form. Similarly, any number of amino acids, ranging from 1-30, can be deleted 
from the carboxy terminus of the secreted protein or mature form. Furthermore, any 
combination of the above amino and carboxy terminus deletions are preferred. 
Similarly, polynucleotide fragments encoding these polypeptide fragments are also 
preferred. 

25 Also preferred are polypeptide and polynucleotide fragments characterized by 

structural or functional domains, such as fragments that comprise alpha-helix and alpha- 
helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 

30 forming regions, substrate binding region, and high antigenic index regions. 
Polypeptide fragments of SEQ ID NO: Y falling within conserved domains are 
specifically contemplated by the present invention. Moreover, polynucleotide 
fragments encoding these domains are also contemplated. 

Other preferred fragments are biologically active fragments. Biologically active 

35 fragments are those exhibiting activity similar, but not necessarily identical, to an 
activity of the polypeptide of the present invention. The biological activity of the 
fragments may include an improved desired activity, or a decreased undesirable activity. 
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Epitopes & Antihnditx; 

In the present invention, "epitopes" refer to polypeptide fragments having 
antigenic or immunogenic activity in an animal, especially in a human. A preferred 
5 embodiment of the present invention relates to a polypeptide fragment comprising an 
epitope, as well as the polynucleotide encoding this fragment. A region of a protein 
molecule to vi^hich an antibody can bind is defined as an "antigenic epitope." In 
contrast, an "immunogenic epitope" is defined as a part of a protein that elicits an 
antibody response. (See, for instance, Geysen et al., Proc. Natl. Acad. Sci. USA 
10 81:3998-4002(1983).) 

Fragments vi'hich function as epitopes may be produced by any conventional 
means. (See, e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 
(1985) further described in U.S. Patent No. 4,631,21 1.) 

In the present invention, antigenic epitopes preferably contain a sequence of at 
15 least seven, more preferably at least nine, and most preferably between about 15 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See, for instance, Wilson et 
al.. Cell 37:767-778 (1984); Sutcliffe, J. G. et al.. Science 219:660-666 (1983).) 

Similarly, immunogenic epitopes can be used to induce antibodies according to 
20 methods well known in the art. (See, for instance, Sutcliffe et al., supra; Wilson et al., 
supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. et 
al.. J. Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes 
the secreted protein. The immunogenic epitopes may be presented together with a 
carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if 
25 it is long enough (at least about 25 amino acids), without a carrier. However, 

inmiunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to be 
sufficient to raise antibodies capable of binding to, at the very least, linear epitopes in a 
denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
30 meant to include intact molecules as well as antibody fragments (such as, for example, 
Fab and F(ab')2 fragments) which are capable of specifically binding to protein. Fab 
and F(ab')2 fragments lack the Fc fragment of intact antibody, clear more rapidly from 
the circulation, and may have less non-specific tissue binding than an intact antibody. 
(Wahl et al., J. Nucl. Med. 24:316-325 (1983).) Thus, these fragments are preferred, 
35 as well as the products of a FAB or other immunoglobulin expression library. 
Moreover, antibodies of the present invention include chimeric, single chain, and 
humanized antibodies. 
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Fusion Proteins 

Any polypeptide of the present invention can be used to generate fusion 
proteins. For example, the polypeptide of the present invention, when fused to a 

5 second protein, can be used as an antigenic tag. Antibodies raised against the 

polypeptide of the present invention can be used to indirectly detect the second protein 
by binding to the polypeptide. Moreover, because secreted proteins target cellular 
locations based on trafficking signals, the polypeptides of the present invention can be 
used as targeting molecules once fused to other proteins. 

10 Examples of domains that can be fused to polypeptides of the present invention 

include not only heterologous signal sequences, but also other heterologous functional 
regions. The fusion does not necessarily need to be direct, but may occur through 
linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics of 

15 the polypeptide of the present invention. For instance, a region of additional amino 
acids, particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence during purification from the host cell or 
subsequent handling and storage. Also, peptide moieties may be added to the 
polypeptide to facilitate purification. Such regions may be removed prior to final 

20 preparation of the polypeptide. The addition of peptide moieties to facilitate handling of 
polypeptides are familiar and routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 
inmiunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 

25 facilitate purification and show an increased half-life in vivo. One reported example 
describes chimeric proteins consisting of the first two domains of the human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains of 
mammalian immunoglobulins. (EP A 394,827; Traunecker et al.. Nature 33 1 :84-86 
(1988).) Fusion proteins having disulfide-linked dimeric structures (due to the IgG) 

30 can also be more efficient in binding and neutralizing other molecules, than the 
monomeric secreted protein or protein fragment alone. (Fountoulakis et al., J. 
Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 
proteins comprising various portions of constant region of immunoglobulin molecules 

35 together with another human protein or part thereof. In many cases, the Fc part in a 
fiision protein is beneficial in therapy and diagnosis, and thus can result in, for 
example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternatively. 
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deleting the Fc part after the fusion protein has been expressed, detected, and purified, 
would be desired. For example, the Fc portion may hinder therapy and diagnosis if the 
fusion protein is used as an antigen for inununizations. In drug discovery, for 
example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5. (See, D. 
Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. Johanson et al., J. Biol. 
Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. In 
prefeired embodiments, the marker amino acid sequence is a hexa-histidine peptide, 
such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, 
Chatsworth, CA, 9131 1), among others, many of which are commercially available. 
As described in Gentz et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for 
instance, hexa-histidine provides for convenient purification of the fusion protein. 
Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 
derived from the influenza hemagglutinin protein. (Wilson et al.. Cell 37:767 (1984).) 

Thus, any of tiiese above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

Vectors. Host CpIIs, and Prnti»in Production 

The present invention also relates to vectors containing the polynucleotide of the 
present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid. viral, or retroviral 
vector. Retroviral vectors may be replication competent or replication defective. In the 
latter case, viral propagation generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a pi^ipitate, such 
as a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is 
a virus, it may be packaged in vitro using an appropriate packaging cell line and then 
transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac, tip. phoA and tac 
promoters, the SV40 early and late promoters and promoters of reu-oviral LTRs, to 
name a few. Other suitable promoters will be known to tiie skilled artisan. The 
expression constructs will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the tianscripts expressed by the constiucts will preferably include a 



wo 99/46289 PCT/US99/05721 

71 



translation initiating codon at the beginning and a termination codon (U AA, UGA or 
UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G418 or neomycin 
5 resistance for eukaryotic cell culture and tetracycline, kanamycin or ampicillin resistance 
genes for culturing in E, coli and other bacteria. Representative examples of 
appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 
Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila 82 and Spodoptera Sf9 cells; animal cells such as CHO, COS, 
10 293, and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, 
available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, 
pNH16a, pNHlSA, pNH46A, available from Stratagene Cloning Systems, Inc.; and 
15 ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 available from Pharmacia Biotech, 
Inc. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTl 
and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available 
from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan. 
Introduction of the construct into the host cell can be effected by calcium 
20 phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection, or other methods. Such methods 
are described in many standard laboratory manuals, such as Davis et al., Basic Methods 
In Molecular Biology (1986). It is specifically contemplated that the polypeptides of the 
present invention may in fact be expressed by a host cell lacking a recombinant vector. 
25 A polypeptide of this invention can be recovered and purified from recombinant 

cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
30 preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. 

Polypeptides of the present invention, and preferably the secreted form, can also 
be recovered from: products purified from natural sources, including bodily fluids, 
tissues and cells, whether directly isolated or cultured; products of chemical synthetic 
35 procedures; and products produced by recombinant techniques from a prokaryotic or 
eukaryotic host, including, for example, bacterial, yeast, higher plant, insect, and 
mammalian cells. Depending upon the host employed in a recombinant production 
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procedure, the polypeptides of the present invention may be glycosylated or may be 
non-glycosylated. In addition, polypeptides of the invention may also include an initial 
modified methionine residue, in some cases as a result of host-mediated processes. 
Thus, it is well known in the art that the N-terminal methionine encoded by the 
translation initiation codon generally is removed with high efficiency from any protein 
after translation in all eukaryotic cells. While the N-teraiinal methionine on most 
proteins also is efficiently removed in most prokaryotes, for some proteins, this 
prokaryotic removal process is inefficient, depending on the nature of the amino acid to 
which the N-terminal methionine is covalentiy linked. 

In addition to encompassing host cells containing the vector consttucts 
discussed herein, the invention also encompasses primary, secondary, and 
immortalized host cells of vertebrate origin, particularly mammaUan origin, that have 
been engineered to delete or replace endogenous genetic material (e.g.. coding 
sequence), and/or to include genetic material (e.g., heterologous polynucleotide 
sequences) that is operably associated with the polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, 
techniques known in the art may be used to operably associate heterologous control 
regions (e.g., promoter and/or enhancer) and endogenous polynucleotide sequences via 
homologous recombination (see, e.g., U.S. Patent No. 5,641,670, issued June 24, 
1997; International Publication No. WO 96/29411, published September 26, 1996; 
International Publication No. WO 94/12650, published August 4, 1994; KoUer et al.! 
Proc. Nad. Acad. Sci. USA 86:8932-8935 (1989); and Zijlsti^ et al.. Nature 342:435- 
438 (1989), the disclosures of each of which are incorporated by reference in tiieir 
entireties). 



Uses nf »hP PolvniiplontiHoc 

Each of tiie polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utilizes 
30 known techniques. 

The polynucleotides of the present invention are useful for chromosome 
identification. There exists an ongoing need to identify new chromosome markers, 
since few chromosome marking reagents, based on actual sequence data (repeat 
polymoiphisms), are presenUy available. Each polynucleotide of the present invention 
35 can be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PGR primers 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
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selected using computer analysis so that primers do not span more than one predicted 
exon in the genomic DNA. These primers are then used for PCR screening of somatic 
cell hybrids containing individual human chromosomes. Only those hybrids containing 
the human gene corresponding to the SEQ ID NO:X will yield an amplified fragment. 
5 Similarly, somatic hybrids provide a rapid method of PCR mapping the 

polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the polynucleotides can 
be achieved with panels of specific chromosome fragments. Other gene mapping 
strategies that can be used include in situ hybridization, prescreening with labeled flow- 

10 sorted chromosomes, and preselection by hybridization to construct chromosome 
specific-cDNA libraries. 

Precise chromosomal location of the polynucleotides can also be achieved using 
fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. This 
technique uses polynucleotides as short as 500 or 600 bases; however, polynucleotides 

15 2,000-4,000 bp are preferred. For a review of this technique, see Verma et al., 

"Human Chromosomes; a Manual of Basic Techniques," Pergamon Press, New York 
(1988). 

For chromosome mapping, the polynucleotides can be used individually (to 
mark a single chromosome or a single site on that chromosome) or in panels (for 

20 marking multiple sites and/or multiple chromosomes). Preferred polynucleotides 

correspond to the noncoding regions of the cDNAs because the coding sequences are 
more likely conserved within gene families, thus increasing the chance of cross 
hybridization during chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, the 

25 physical position of the polynucleotide can be used in linkage analysis. Linkage 

analysis establishes coinheritance between a chromosomal location and presentation of a 
particular disease. (Disease mapping data are found, for example, in V. McKusick, 
Mendelian Inheritance in Man (available on line through Johns Hopkins University 
Welch Medical Library) .) Assuming 1 megabase mapping resolution and one gene per 

30 20 kb, a cDNA precisely localized to a chromosomal region associated with the disease 
could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 
First, visible structural alterations in the chromosomes, such as deletions or 

35 translocations, are examined in chromosome spreads or by PCR. If no structural 

alterations exist, the presence of point mutations are ascertained. Mutations observed in 
some or all affected individuals, but not in normal individuals, indicates that the 



# 



wo 99/46289 PCT/US99/05721 

74 



mutation may cause the disease. However, complete sequencing of the polypeptide and 
the corresponding gene from several normal individuals is required to distinguish the 
mutation from a polymorphism. If a new polymorphism is identified, this polymorphic 
polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected 
individuals as compared to unaffected individuals can be assessed usina 
polynucleotides of the present invention. Any of these alterations (altered expression, 
chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 
marker. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix fomiation or antisense DNA or RNA. Both methods 
rely on binding of the polynucleotide to DNA or RNA. For these techniques, preferred 
polynucleotides are usually 20 to 40 bases in length and complementary to either the 
region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids 
Res. 6:3073 (1979); Cooney et al.. Science 241:456 (1988); and Dervan et aL, Science 
251: 1360 (1991) ) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene Expression, CRC 
Press, Boca Raton, PL (1988).) Triple helix formation optimally results in a shut-off 
of RNA transcription from DNA, while antisense RNA hybridization blocks translation 
of an mRNA molecule into polypeptide. Both techniques are effective in model 
systems, and the information disclosed herein can be used to design antisense or triple 
helix polynucleotides in an effort to treat disease. 

Polynucleotides of the present invention are also useful in gene therapy. One 
goal of gene therapy is to insert a normal gene into an organism having a defective 
gene, in an effort to correct the genetic defect. The polynucleotides disclosed in the 
present invention offer a means of targeting such genetic defects in a highly accurate 
manner . Another goal is to insert a new gene that was not present in the host genome, 
thereby producing a new trait in the host cell. 

The polynucleotides are also useful for identifying individuals from minute 
biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. In 
this technique, an individual's genomic DNA is digested with one or more restriction 
enzymes, and probed on a Southern blot to yield unique bands for identifying 
personnel. This method does not suffer from the current limitations of "Dog Tags" 
which can be lost, switched, or stolen, making positive identification difficult. The 
polynucleotides of the present invention can be used as additional DNA markers for 
RFLP. 
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The polynucleotides of the present invention can also be used as an alternative to 
RFLP, by determining the actual base-by-base DNA sequence of selected portions of an 
individual's genome. These sequences can be used to prepare PCR primers for 
amplifying and isolating such selected DNA, v^hich can then be sequenced. Using this 

5 technique, individuals can be identified because each individual will have a unique set 
of DNA sequences. Once an unique ID database is established for an individual, 
positive identification of that individual, living or dead, can be made from extremely 
small tissue samples. 

Forensic biology also benefits from using DN A-based identification techniques 

10 as disclosed herein. DNA sequences taken from very small biological samples such as 
tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, etc., can be 
amplified using PCR. In one prior art technique, gene sequences amplified from 
polymorphic loci, such as DQa class 11 HLA gene, are used in forensic biology to 
identify individuals. (Erlich, H., PCR Technology, Freeman and Co. (1992).) Once 

15 these specific polymorphic loci are amplified, they are digested with one or more 

restriction enzymes, yielding an identifying set of bands on a Southern blot probed with 
DNA corresponding to the DQa class II HLA gene. Similarly, polynucleotides of the 
present invention can be used as polymorphic markers for forensic purposes. 

There is also a need for reagents capable of identifying the source of a particular 

20 tissue. Such need arises, for example, in forensics when presented with tissue of 
unknown origin. Appropriate reagents can comprise, for example, DNA probes or 
primers specific to particular tissue prepared from the sequences of the present 
invention. Panels of such reagents can identify tissue by species and/or by organ type. 
In a similar fashion, these reagents can be used to screen tissue cultures for 

25 contamination. 

In the very least, the polynucleotides of the present invention can be used as 
molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRN A in a particular cell type, as a probe to "subtract-out" known sequences 
in the process of discovering novel polynucleotides, for selecting and making oligomers 

30 for attachment to a "gene chip" or other support, to raise anti-DNA antibodies using 
DNA immunization techniques, and as an antigen to elicit an immune response. 

Uses of the Polypeptides 

Each of the polypeptides identified herein can be used in numerous ways. The 
35 following description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a 
biological sample using antibody-based techniques. For example, protein expression in 
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tissues can be studied with classical immunohistological methods. (Jalkanen, M., et 
ah, J. Cell. Biol. 101:976-985 (1985); Jalkanen, M., et ah, J. Cell . Biol. 105:3087- 
3096 (1987).) Other antibody-based methods useful for detecting protein gene 
expression include immunoassays, such as the enzyme linked immunosorbent assay 
5 (ELISA) and the radioimmunoassay (RIA). Suitable antibody assay labels are known 
in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 
as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), indium (1 12In), and 
technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

10 I" addition to assaying secreted protein levels in a biological sample, proteins 

can also be detected in vivo by imaging. Antibody labels or markers for in vivo 
imaging of protein include those detectable by X-radiography, NMR or ESR. For X- 
radiography, suitable labels include radioisotopes such as barium or cesium, which emit 
detectable radiation but are not overtly harmful to the subject. Suitable markers for 

15 NMR and ESR include those with a detectable characteristic spin, such as deuterium, 
which may be incorporated into the antibody by labehng of nutrients for the relevant 
hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with 
an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 

20 1 12In, 99mTc), a radio-opaque substance, or a material detectable by nuclear magnetic 
resonance, is introduced (for example, parenterally, subcutaneously, or 
intraperitoneally) into the mammal. It will be understood in the art that the size of the 
subject and the imaging system used will determine the quantity of imaging moiety 
needed to produce diagnostic images. In the case of a radioisotope moiety, for a human 

25 subject, the quantity of radioactivity injected will normally range from about 5 to 20 
millicuries of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the specific protein. In 
vivo tumor imaging is described in S.W. Burchiel et al., "Immunopharmacokinetics of 
Radiolabeled Antibodies and Their Fragments." (Chapter 13 in Tumor Imaging: The 

30 Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., Masson 
Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which involves 
(a) assaying the expression of a polypeptide of the present invention in cells or body 
fluid of an individual; (b) comparing the level of gene expression with a standard gene 

35 expression level, whereby an increase or decrease in the assayed polypeptide gene 
expression level compared to the standard expression level is indicative of a disorder. 
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Moreover, polypeptides of the present invention can be used to treat disease. 
For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 
supplement absent or decreased levels of a different polypeptide (e.g., hemoglobin S 

5 for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an oncogene), to 
activate the activity of a polypeptide (e.g., by binding to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free hgand (e.g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 
response (e.g., blood vessel growth). 

10 Similarly, antibodies directed to a polypeptide of the present invention can also 

be used to treat disease. For example, administration of an antibody directed to a 
polypeptide of the present invention can bind and reduce overproduction of the 
polypeptide. Similarly, administration of an antibody can activate the polypeptide, such 
as by binding to a polypeptide bound to a membrane (receptor). 

15 At the very least, the polypeptides of the present invention can be used as 

molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods well known to those of skill in the art. Polypeptides can also 
be used to raise antibodies, which in turn are used to measure protein expression from a 
recombinant cell, as a way of assessing transformation of the host cell. Moreover, the 

20 polypeptides of the present invention can be used to test the following biological 
activities. 

Biological Activities 



25 assays to test for one or more biological activities. If these polynucleotides and 

polypeptides do exhibit activity in a particular assay, it is likely that these molecules 
may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides and polypeptides could be used to treat the associated disease. 

30 Immune Activity 

A polypeptide or polynucleotide of the present invention may be useful in 
treating deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
cells develop through a process called hematopoiesis, producing myeloid (platelets, red 
35 blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) cells 
from pluripotent stem cells. The etiology of these immune deficiencies or disorders 
may be genetic, somatic, such as cancer or some autoimmune disorders, acquired (e.g.. 
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by chemotherapy or toxins), or infectious. Moreover, a polynucleotide or polypeptide 
of the present invention can be used as a marker or detector of a particular immune 
system disease or disorder. 

A polynucleotide or polypeptide of the present invention may be useful in 
treating or detecting deficiencies or disorders of hematopoietic cells. A polypeptide or 
polynucleotide of the present invention could be used to increase differentiation and 
proliferation of hematopoietic cells, including the pluripotent stem cells, in an effort to 
treat those disorders associated with a decrease in certain (or many) types hematopoietic 
cells. Examples of immunologic deficiency syndromes include, but are not limited to: 
blood protein disorders (e.g. agammaglobulinemia, dysgammaglobulinemia), ataxia 
telangiectasia, common variable immunodeficiency, Digeorge Syndrome, HIV 
mfection, HTLV-BLV infection, leukocyte adhesion deficiency syndrome, 
lymphopenia, phagocyte bactericidal dysfunction, severe combined immunodeficiency 
(SCIDs), Wiskott-Aldrich Disorder, anemia, thrombocytopenia, or hemoglobinuria. 

Moreover, a polypeptide or polynucleotide of the present invention could also 
be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity (clot 
fonnation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present invention could be used to treat blood 
coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 
disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or other 
causes. Alternatively, a polynucleotide or polypeptide of the present invention that can 
decrease hemostatic or thrombolytic activity could be used to inhibit or dissolve 
clotting. These molecules could be important in the ti«atinent of heart attacks 
(infarction), strokes, or scaning. 

A polynucleotide or polypeptide of tiie present invention may also be useful in 
ti^eating or detecting autoimmune disorders. Many autoimmune disorders result from 
inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading to the destruction of the 
host tissue. Therefore, the administration of a polypeptide or polynucleotide of the 
present invention diat inhibits an immune response, particularly the prohferation, 
differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be ti^eated or detected by the present 
invention include, but are not limited to: Addison's Disease, hemolytic anemia, 
antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic encephalomyelitis, 
glomerulonephritis. Goodpasture's Syndrome, Graves' Disease, Multiple Sclerosis, 
Myasthenia Gravis, Neuritis. Ophtiialmia, Bullous Pemphigoid, Pemphigus, 
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Polyendocrinopathies, Purpura, Reiier's Disease, Stiff-Man Syndrome, Autoimmune 
Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary Inflammation, 
Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and autoimmune 
inflanmiatory eye disease. 
5 Similarly, allergic reactions and conditions, such as asthma (particularly allergic 

asthma) or other respiratory problems, may also be treated by a polypeptide or 
polynucleotide of the present invention. Moreover, these molecules can be used to treat 
anaphylaxis, hypersensitivity to an antigenic molecule, or blood group incompatibility. 
A polynucleotide or polypeptide of the present invention may also be used to 
10 treat and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ 

rejection occurs by host inunune cell destruction of the transplanted tissue through an 
immune response. Similarly, an immune response is also involved in GVHD, but, in 
this case, the foreign transplanted immune cells destroy the host tissues. The 
administration of a polypeptide or polynucleotide of the present invention that inhibits 
15 an immune response, particularly the proliferation, differentiation, or chemotaxis of T- 
cells, may be an effective therapy in preventing organ rejection or GVHD, 

Similarly, a polypeptide or polynucleotide of the present invention may also be 
used to modulate inflammation. For example, the polypeptide or polynucleotide may 
inhibit the proliferation and differentiation of cells involved in an inflammatory 
20 response. These molecules can be used to treat inflanmiatory conditions, both chronic 
and acute conditions, including inflammation associated with infection (e.g., septic 
shock, sepsis, or systemic inflammatory response syndrome (SIRS)), ischemia- 
reperfusion injury, endotoxin lethality, arthritis, complement-mediated hyperacute 
rejection, nephritis, cytokine or chemokine induced lung injury, inflaimnatory bowel 
25 disease, Crohn's disease, or resulting from over production of cytokines (e.g., TNF or 
IL-1.) 

Hvperproiiferative Disorders 

A polypeptide or polynucleotide can be used to treat or detect hyperproliferative 
30 disorders, including neoplasms. A polypeptide or polynucleotide of the present 
invention may inhibit the proliferation of the disorder through direct or indirect 
interactions. Alternatively, a polypeptide or polynucleotide of the present invention 
may proliferate other cells which can inhibit the hyperproliferative disorder. 

For example, by increasing an immune response, particularly increasing 
35 antigenic qualities of the hyperproliferative disorder or by proliferating, differentiating, 
or mobilizing T-cells, hyperproliferative disorders can be treated. This immune 
response may be increased by either enhancing an existing immune response, or by 
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initiating a new immune response. Alternatively, decreasing an immune response may 
also be a method of treating hyperproliferative disorders, such as a chemotherapeutic 
agent. 

Examples of hyperproliferative disorders that can be treated or detected by a 
polynucleotide or polypeptide of the present invention include, but are not limited to 
neoplasms located in the: abdomen, bone, breast, digestive system, liver, pancreas, 
peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 
pelvic, skin, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by a 
polynucleotide or polypeptide of the present invention. Examples of such 
hypeiproliferative disorders include, but are not limited to: hypergammaglobulinemia, 
lymphoproliferative disorders, paraproteinemias, purpura, sarcoidosis, Sezary 
Syndrome, Waldenstron's Macroglobulinemia, Gaucher's Disease, histiocytosis, and 
1 5 any other hyperproliferative disease, besides neoplasia, located in an organ system 
listed above. 

Infectious ni<,!«>agA 

A polypeptide or polynucleotide of the present invention can be used to treat or 

20 detect infectious agents. For example, by increasing the immune response, particularly 
increasing the proliferation and differentiation of B and/or T cells, infectious diseases 
may be treated. The immune response may be increased by either enhancing an existing 
immune response, or by initiating a new immune response. Alternatively, the 
polypeptide or polynucleotide of tiie present invention may also directiy inhibit the 

25 infectious agent, witiiout necessarily eliciting an immune response. 

Viruses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
present invention. Examples of viruses, include, but are not limited to the following 
DNA and RNA viral families: Arbovirus, Adenoviridae. Arenaviridae, Arterivirus, 

30 Bimaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Raviviridae, 
Hepadnaviridae (Hepatitis), Herpesviridae (such as. Cytomegalovirus, Herpes 
Simplex, Herpes Zoster), Mononegavinis (e.g., Paramyxoviridae, Morbillivirus, 
Rhabdoviridae), Orthomyxoviridae (e.g., InHuenza), Papovaviridae. Parvoviridae, 
Picomaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 

35 Rotavims), Retroviridae (HTLV-I, HTLV-H, Lentivirus), and Togaviridae (e.g., 
Rubiviius). Virases falling within these families can cause a variety of diseases or 
symptoms, including, but not limited to: arthritis, bronchiollitis, encephalitis, eye 
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infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C, 
E, Chronic Active, Delta), meningitis, opportunistic infections (e.g., AIDS), 
pneumonia, Burkitt's Lymphoma, chickenpox , hemorrhagic fever. Measles, Mumps, 
Parainfluenza, Rabies, the common cold, Polio, leukemia. Rubella, sexually 
5 transmitted diseases, skin diseases (e.g., Kaposi's, warts), and viremia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

Similarly, bacterial or fungal agents that can cause disease or symptoms and that 
can be treated or detected by a polynucleotide or polypeptide of the present invention 
10 include, but not limited to, the following Gram-Negative and Gram-positive bacterial 
families and fiingi: Actinomycetales (e.g., Corynebacterium, Mycobacterium, 
Norcardia), Aspergillosis, Bacillaceae (e.g.. Anthrax, Clostridium), Bacteroidaceae, 
Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, Campylobacter, 
Coccidioidomycosis, Cryptococcosis, Denriatocycoses, Enterobacteriaceae (Klebsiella, 
15 Salmonella, Serratia, Yersinia), Erysipelothrix, Helicobacter, Legionellosis, 
Leptospirosis, Listeria, Mycoplasmatales, Neisseriaceae (e.g., Acinetobacter, 
Gonorrhea, Menigococcal), Pasteurellacea Infections (e.g., Actinobacillus, 
^ Heamophilus, Pasteurella), Pseudomonas, Rickettsiaceae, Chlamydiaceae, Syphilis, 
and Staphylococcal. These bacterial or fungal families can cause the following diseases 
20 or symptoms, including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., AIDS 
related infections), paronychia, prosthesis-related infections, Reiter's Disease, 
respiratory tract infections, such as Whooping Cough or Empyema, sepsis, Lyme 
Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, food poisoning, 
25 Typhoid, pneumonia. Gonorrhea, meningitis. Chlamydia, Syphilis, Diphtheria, 
Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, tetanus, 
impetigo. Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin diseases 
(e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound infections. 
A polypeptide or polynucleotide of the present invention can be used to treat or detect 
30 any of these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, Helminthiasis, 
35 Leishmaniasis, Theileriasis, Toxoplasmosis, Trypanosomiasis, and Trichomonas. 

These parasites can cause a variety of diseases or symptoms, including, but not limited 
to: Scabies, Trombiculiasis, eye infections, intestinal disease (e.g., dysentery, 
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giardiasis), liver disease, lung disease, opportunistic infections (e.g., AIDS related). 
Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or polynucleotide 
of the present invention can be used to treat or detect any of these symptoms or 
diseases. 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could either be by administering an effective amount of a polypeptide to the 
patient, or by removing cells from the patient, supplying the cells with a polynucleoUde 
of the present invention, and returning the engineered cells to the patient (ex vivo 
therapy). Moreover, the polypeptide or polynucleotide of the present invention can be 
used as an antigen in a vaccine to raise an immune response against infectious disease. 



Regenerafinn 

A polynucleotide or polypeptide of the present invention can be used to 
differentiate, proliferate, and attract cells, leading to the regeneration of tissues. (See. 
15 Science 276:59-87 (1997).) The regeneration of tissues could be used to repair, 
replace, or protect tissue damaged by congenital defects, trauma (wounds, bums, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal 
disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, reperfusion 
injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal 
or cardiac), vasculature (including vascular and lymphatics), nervous, hematopoietic, 
and skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration 
occurs without or decreased scarring. Regeneration also may include angiogenesis. 

Moreover, a polynucleotide or polypeptide of the present invention may increase 
regeneration of tissues difficult to heal. For example, increased tendon/ligament 
regeneration would quicken recovery time after damage. A polynucleotide or 
polypeptide of the present invention could also be used prophylactically in an effort to 
avoid damage. Specific diseases that could be treated include of tendinitis, carpal tunnel 
syndrome, and other tendon or ligament defects. A further example of tissue 
regeneration of non-healing wounds includes pressure ulcers, ulcers associated with 
vascular insufficiency, surgical, and traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using a 
polynucleotide or polypeptide of the present invention to proliferate and differentiate 
35 nerve cells. Diseases that could be treated using fliis method include central and 
peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
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Stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 
neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 
5 Drager syndrome), could all be treated using the polynucleotide or polypeptide of the 
present invention. 

Chemotaxis 

A polynucleotide or polypeptide of the present invention may have chemotaxis 

10 activity. A chemotaxic molecule attracts or mobilizes cells (e.g., monocytes, 

fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial 
cells) to a particular site in the body, such as inflammation, infection, or site of 
hyperproliferation. The mobilized cells can then fight off and/or heal the particular 
trauma or abnormality. 

15 A polynucleotide or polypeptide of the present invention may increase 

chemotaxic activity of particular cells. These chemotactic molecules can then be used to 
treat inflammation, infection, hyperproliferative disorders, or any immune system 
disorder by increasing the number of cells targeted to a particular location in the body. 
For example, chemotaxic molecules can be used to treat wounds and other trauma to 

20 tissues by attracting immune cells to the injured location. Chemotactic molecules of the 
present invention can also attract fibroblasts, which can be used to treat wounds. 

It is also contemplated that a polynucleotide or polypeptide of the present 
invention may inhibit chemotactic activity. These molecules could also be used to treat 
disorders. Thus, a polynucleotide or polypeptide of the present invention could be used 

25 as an inhibitor of chemotaxis. 



Binding Activity 

A polypeptide of the present invention may be used to screen for molecules that 
bind to the polypeptide or for molecules to which the polypeptide binds. The binding 
30 of the polypeptide and the molecule may activate (agonist), increase, inhibit 

(antagonist), or decrease activity of the polypeptide or the molecule bound. Examples 
of such molecules include antibodies, oligonucleotides, proteins (e.g., receptors),or 
small molecules. 

Preferably, the molecule is closely related to the natural ligand of the 
35 polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 
or functional mimetic. (See, Coligan et al.. Current Protocols in Immunology 
l(2):Chapter 5 (1991).) Similarly, the molecule can be closely related to the natural 
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receptor to which the polypeptide binds, or at least, a fragment of the receptor capable 
of being bound by the polypeptide (e.g., active site). In either case, the molecule can 
be rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate 
5 cells which express the polypeptide, either as a secreted protein or on the cell 

membrane. Preferred cells include cells from mammals, yeast. Drosophila, or E. coli. 
Cells expressing the polypeptide (or cell membrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially containing 
the molecule to observe binding, stimulation, or inhibition of activity of either the 

1 0 polypeptide or the molecule. 

The assay may simply test binding of a candidate compound to the polypeptide, 
wherein binding is detected by a label, or in an assay involving competition with a 
labeled competitor. Further, the assay may test whether the candidate compound results 
in a signal generated by binding to the polypeptide. 

15 Alternatively, the assay can be carried out using cell-free preparations, 

polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide, measuring polypeptide/molecule 
acfivity or binding, and comparing the polypeptide/molecule activity or binding to a 

20 standard. 

Preferably, an ELISA assay can measure polypeptide level or activity in a 
sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
antibody can measure polypeptide level or activity by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 

All of these above assays can be used as diagnostic or prognostic markers. The 
molecules discovered using these assays can be used to treat disease or to bring about a 
particular result in a patient (e.g., blood vessel growth) by activating or inhibiting the 
polypeptide/molecule. Moreover, the assays can discover agents which may inhibit or 
enhance the production of the polypeptide from suitably manipulated cells or tissues. 
30 Therefore, the invention includes a method of identifying compounds which 

bind to a polypeptide of the invention comprising the steps of: (a) incubating a 
candidate binding compound with a polypeptide of the invention; and (b) determining if 
binding has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate compound with 
a polypeptide of the invention, (b) assaying a biological activity , and (b) determining if 
a biological activity of the polypeptide has been altered. 
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Other Activities 

A polypeptide or polynucleotide of the present invention may also increase or 
decrease the differentiation or proliferation of embryonic stem cells, besides, as 

5 discussed above, hematopoietic lineage. 

A polypeptide or polynucleotide of the present invention may also be used to 
modulate mammalian characteristics, such as body height, weight, hair color, eye color, 
skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
surgery). Similarly, a polypeptide or polynucleotide of the present invention may be 

10 used to modulate mammalian metabolism affecting catabolism, anabolism, processing, 
utilization, and storage of energy. 

A polypeptide or polynucleotide of the present invention may be used to change 
a mammal's mental state or physical state by influencing biorhythms, caricadic 
rhythms, depression (including depressive disorders), tendency for violence, tolerance 

15 for pain, reproductive capabilities (preferably by Activin or Inhibin-like activity), 
hormonal or endocrine levels, appetite, libido, memory, stress, or other cognitive 
qualities. 

A polypeptide or polynucleotide of the present invention may also be used as a 
food additive or preservative, such as to increase or decrease storage capabilities, fat 
20 content, lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional 
components. 

Other Preferred Embodiments 

Other preferred embodiments of the claimed invention include an isolated 
25 nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
to a sequence of at least about 50 contiguous nucleotides in the nucleotide sequence of 
SEQ ID NO:X wherein X is any integer as defined in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
30 positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
Clone Sequence and ending with the nucleotide at about the position of the 3' 
Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
35 positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
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Start Codon and ending with the nucleotide at about the position of the 3' Nucleotide of 
the Clone Sequence as defined for SEQ ID NO:X in Table 1. 

Sinularly preferred is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the 
5 range of positions beginning with the nucleotide at about the position of the 5 ' 

Nucleotide of the First Amino Acid of the Signal Peptide and ending with the nucleotide 
at about the position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID 
NO:X in Table 1. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
10 sequence which is at least 95% identical to a sequence of at least about 1 50 contiguous 
nucleotides in the nucleotide sequence of SEQ ID NO:X. 

Further preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least about 500 contiguous 
nucleotides in the nucleotide sequence of SEQ ID NO:X. 

^ further preferred embodiment is a nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to the nucleotide sequence of SEQ 
ID NO:X beginning with the nucleotide at about the position of the 5' Nucleotide of the 
First Amino Acid of the Signal Peptide and ending with the nucleotide at about the 
position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in 
20 Table 1. 

A further preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least 95% identical to the complete nucleotide 
sequence of SEQ ID NO:X. 

Also preferred is an isolated nucleic acid molecule which hybridizes under 
25 stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic acid 
molecule which hybridizes does not hybridize under stringent hybridization conditions 
to a nucleic acid molecule having a nucleotide sequence consisting of only A residues or 
of only T residues. 

Also preferred is a composition of matter comprising a DNA molecule which 
30 comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1 , 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table 1 for said 
cDNA Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
35 sequence which is at least 95% identical to a sequence of at least 50 contiguous 

nucleotides in the nucleotide sequence of a human cDNA clone identified by a cDNA 
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Clone Identifier in Table 1, which DNA molecule is contained in the deposit given the 
ATCC Eteposit Number shown in Table 1. 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of at 
least 50 contiguous nucleotides is included in the nucleotide sequence of the complete 
5 open reading frame sequence encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to sequence of at least 150 contiguous 
nucleotides in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising 
10 a nucleotide sequence which is at least 95% identical to sequence of at least 500 

contiguous nucleotides in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least 95% identical to the complete nucleotide 
sequence encoded by said human cDNA clone. 
15 A further preferred embodiment is a method for detecting in a biological sample 

a nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
to a sequence of at least 50 contiguous nucleotides in a sequence selected from the 
group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any integer 
as defined in Table 1 ; and a nucleotide sequence encoded by a human cDN A clone 
20 identified by a cDN A Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1; which method 
comprises a step of comparing a nucleotide sequence of at least one nucleic acid 
molecule in said sample with a sequence selected from said group and determining 
whether the sequence of said nucleic acid molecule in said sample is at least 95% 
25 identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 
molecules in said sample and a nucleic acid molecule comprising said sequence selected 
from said group. Similarly, also preferred is the above method wherein said step of 
30 comparing sequences is performed by comparing the nucleotide sequence determined 
from a nucleic acid molecule in said sample with said sequence selected from said 
group. The nucleic acid molecules can comprise DNA molecules or RNA molecules. 

A further preferred embodiment is a method for identifying the species, tissue or 
cell type of a biological sample which method comprises a step of detecting nucleic acid 
35 molecules in said sample, if any, comprising a nucleotide sequence that is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any 
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integer as defined in Table 1 ; and a nucleotide sequence encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

The method for identifying the species, tissue or cell type of a biological sample 
can comprise a step of detecting nucleic acid molecules comprising a nucleotide 
sequence in a panel of at least two nucleotide sequences, wherein at least one sequence 
in said panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides 
in a sequence selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted protein 
identified in Table 1 . which method comprises a step of detecting in a biological sample 
obtained from said subject nucleic acid molecules, if any, comprising a nucleotide 
sequence that is at least 95% identical to a sequence of at least 50 contiguous 
nucleotides in a sequence selected from the group consisting of: a nucleotide sequence 
15 of SEQ ID NO.X wherein X is any integer as defined in Table 1 ; and a nucleotide 
sequence encoded by a human cDNA clone identified by a cDNA Clone Identifier in 
Table 1 and contained in the deposit with the ATCC Deposit Number shown for said 
cDN A clone in Table 1 . 

The method for diagnosing a pathological condition can comprise a step of 
detecting nucleic acid molecules comprising a nucleotide sequence in a panel of at least 
two nucleotide sequences, wherein at least one sequence in said panel is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
said group. 

Also preferred is a composition of matter comprising isolated nucleic acid 
molecules wherein the nucleotide sequences of said nucleic acid molecules comprise a 
panel of at least two nucleotide sequences, wherein at least one sequence in said panel is 
at least 95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein 
X is any integer as defined in Table 1 ; and a nucleotide sequence encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. The 
nucleic acid molecules can comprise DNA molecules or RNA molecules. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 . 

Also preferred is a polypeptide, wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of SEQ ID NO:Y in the range of positions 
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beginning with the residue at about the position of the First Amino Acid of the Secreted 
Portion and ending with the residue at about the Last Amino Acid of the Open Reading 
Frame as set forth for SEQ ID NO:Y in Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
5 least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 100 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y. 
10 Further preferred is an isolated polypeptide comprising an amino acid sequence 

at least 95% identical to the complete amino acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 90% identical to a sequence of at least about 10 contiguous anoino acids in the 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
15 identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is a polypeptide wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of a secreted portion of the secreted protein 
encoded by a human cDN A clone identified by a cDNA Clone Identifier in Table 1 and 
20 contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table L 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 

25 clone identified by a cDN A Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 100 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 

30 clone identified by a cDN A Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDN A clone identified by a cDNA Clone Identifier in Table 1 and 

35 contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table 1. 
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Further preferred is an isolated antibody which binds specifically to a 
polypeptide comprising an amino acid sequence that is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
5 defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identified by a cDN A Clone Identifier in Table 1 and contained in 
the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Further preferred is a method for detecting in a biological sample a polypeptide 
comprising an amino acid sequence which is at least 90% identical to a sequence of at 

10 least 10 contiguous amino acids in a sequence selected from the group consisting of: an 
amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 ; 
and a complete amino acid sequence of a protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 ; which method 

15 comprises a step of comparing an amino acid sequence of at least one polypeptide 
molecule in said sample with a sequence selected from said group and determining 
whether the sequence of said polypeptide molecule in said sample is at least 90% 
identical to said sequence of at least 10 contiguous amino acids. 

Also preferred is the above method wherein said step of comparing an amino 

20 acid sequence of at least one polypeptide molecule in said sample with a sequence 
selected from said group comprises determining the extent of specific binding of 
polypeptides in said sample to an antibody which binds specifically to a polypeptide 
comprising an amino acid sequence that is at least 90% identical to a sequence of at least 
10 contiguous amino acids in a sequence selected from the group consisting of: an 

25 amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table I ; 
and a complete amino acid sequence of a protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is the above method wherein said step of comparing sequences is 

30 performed by comparing the amino acid sequence determined from a polypeptide 
molecule in said sample with said sequence selected from said group. 

Also preferred is a method for identifying the species, tissue or cell type of a 
biological sample which method comprises a step of detecting polypeptide molecules in 
said sample, if any, comprising an amino acid sequence that is at least 90% identical to 

35 a sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1; and a complete amino acid sequence of a secreted protein encoded 
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by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 
in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. 

Also preferred is the above method for identifying the species, tissue or cell type 
of a biological sample, which method comprises a step of detecting polypeptide 
5 molecules comprising an amino acid sequence in a panel of at least two amino acid 
sequences, wherein at least one sequence in said panel is at least 90% identical to a 
sequence of at least 10 contiguous amino acids in a sequence selected from the above 
group. 

Also preferred is a method for diagnosing in a subject a pathological condition 

10 associated with abnormal structure or expression of a gene encoding a secreted protein 
identified in Table 1 , which method comprises a step of detecting in a biological sample 
obtained from said subject polypeptide molecules comprising an amino acid sequence in 
a panel of at least two amino acid sequences, wherein at least one sequence in said panel 
is at least 90% identical to a sequence of at least 10 contiguous amino acids in a 

15 sequence selected from the group consisting of: an amino acid sequence of SEQ ID 
NO:Y wherein Y is any integer as defined in Table 1; and a complete amino acid 
sequence of a secreted protein encoded by a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1 and contained in the deposit with the ATCC Deposit Number 
shown for said cDN A clone in Table 1 . 

20 In any of these methods, the step of detecting said polypeptide molecules 

includes using an antibody. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence that is at least 

25 90% identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from the group consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is 
any integer as defined in Table 1; and a complete amino acid sequence of a secreted 
protein encoded by a human cDN A clone identified by a cDN A Clone Identifier in Table 
1 and contained in the deposit with the ATCC Deposit Number shown for said cDNA 

30 clone in Table 1. 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide 
sequence encoding a polypeptide has been optimized for expression of said polypeptide 
in a prokaryotic host. 

Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 

35 comprises an amino acid sequence selected from the group consisting of: an amino acid 
sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 ; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
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identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDN A clone in Table 1 . 

Further preferred is a method of making a recombinant vector comprising 
inserting any of the above isolated nucleic acid molecule into a vector. Also preferred is 
5 the recombinant vector produced by this method. Also preferred is a method of making 
a recombinant host cell comprising introducing the vector into a host cell, as well as the 
recombinant host cell produced by this method. 

Also preferred is a method of making an isolated polypeptide comprising 
culturing this recombinant host cell under conditions such that said polypeptide is 

10 expressed and recovering said polypeptide. Also preferred is this method of making an 
isolated polypeptide, wherein said recombinant host cell is a eukaryotic cell and said 
polypeptide is a secreted portion of a human secreted protein comprising an amino acid 
sequence selected from the group consisting of: an amino acid sequence of SEQ ID 
NO: Y beginning with the residue at the position of the First Amino Acid of the Secreted 

15 Portion of SEQ ID NO:Y wherein Y is an integer set forth in Table 1 and said position 
of the First Amino Acid of the Secreted Portion of SEQ ID NO:Y is defined in Table 1; 
and an amino acid sequence of a secreted portion of a protein encoded by a human 
cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1. The 

20 isolated polypeptide produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an increased 
level of a secreted protein activity, which method comprises administering to such an 
individual a pharmaceutical composition comprising an amount of an isolated 
polypeptide, polynucleotide, or antibody of the claimed invention effective to increase 

25 the level of said protein activity in said individual. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
illustration and are not intended as limiting. 

30 Examples 

Example 1; Isolation of a Selected cDN A Clone From the Deposited 
ganypie 

Each cDN A clone in a cited ATCC deposit is contained in a plasmid vector. 
35 Table 1 identifies the vectors used to construct the cDNA library from which each clone 
was isolated. In many cases, the vector used to construct the library is a phage vector 
from which a plasmid has been excised. The table inunediately below correlates the 
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related plasmid for each phage vector used in constructing the cDNA library. For 
example, where a particular clone is identified in Table 1 as being isolated in the vector 
"Lambda Zap," the corresponding deposited clone is in "pBluescript." 



Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap 
XR (U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 
5,128,256 and 5,286,636), pBluescript (pBS) (Short, J. M. et al.. Nucleic Acids Res. 
16:7583-7600 (1988); Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 
17:9494 (1989)) and pBK (Alting-Mees, M. A. et al.. Strategies 5:58-61 (1992)) are 
^ commercially available from Stratagene Cloning Systems, Inc., 11011 N. Torrey Pines 
Road, La JoUa, CA, 92037. pBS contains an ampicillin resistance gene and pBK 
contains a neomycin resistance gene. Both can be transformed into E. coli strain XL-1 
Blue, also available from Stratagene. pBS comes in 4 forms SK-h, SK-, KS+ and KS. 
The S and K refers to the orientation of the polylinker to the T7 and T3 primer 
sequences which flank the polylinker region ("S" is for Sad and "K" is for Kpnl which 
are the first sites on each respective end of the linker). or refer to the orientation 
of the fl origin of replication ("ori"), such that in one orientation, single stranded rescue 
initiated from the f 1 ori generates sense strand DNA and in the other, antisense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from 
Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Spon vectors 
contain an ampicillin resistance gene and may be transformed into E. coli strain 
DHIOB, also available from Life Technologies. (See, for instance, Gruber, C. E., et 
al.. Focus 15:59 (1993).) Vector lafmid BA (Bento Soares, Columbia University, NY) 
contains an ampicillin resistance gene and can be transformed into E. coli strain XL-1 
Blue. Vector pCR®2.1, which is available from Invitrogen, 1600 Faraday Avenue, 
Carlsbad, CA 92008, contains an ampicillin resistance gene and may be transformed 
into E. coli strain DHIOB, available from Life Technologies. (See, for instance, Clark, 
J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead, D. et al., Bio/Technology 9: 
(1991).) Preferably, a polynucleotide of the present invention does not comprise the 



Vector Used to Construct Library 

Lambda Zap 

Uni-Zap XR 

Zap Express 

lafmid BA 

pSportl 

pCMVSport 2.0 
pCMVSport 3.0 
pCR®2.1 



Corresponding Deposited Plasmid 
pBluescript (pBS) 
pBluescript (pBS) 
pBK 

plafmid B A 
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pCMVSport 2.0 
pCMVSport 3.0 
pCR®2,l 
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phage vector sequences identified for the particular clone in Table 1, as well as the 
corresponding plasmid vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number cited 
in Table 1 for any given cDN A clone also may contain one or more additional plasmids, 
5 each comprising a cDNA clone different from that given clone. Thus, deposits sharing 
the same ATCC Deposit Number contain at least a plasmid for each cDNA clone 
identified in Table 1. Typically, each ATCC deposit sample cited in Table 1 comprises 
a mixture of approximately equal amounts (by weight) of about 50 plasmid DNAs, each 
containing a different cDNA clone; but such a deposit sample may include plasmids for 
10 more or less than 50 cDNA clones, up to about 500 cDNA clones. 

Two approaches can be used to isolate a particular clone from the deposited 
sample of plasmid DNAs cited for that clone in Table 1 . First, a plasmid is directly 
isolated by screening the clones using a polynucleotide probe corresponding to SEQ ID 
NO:X. 

15 Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 

using an Applied Biosystems DNA synthesizer according to the sequence reported. 

The oligonucleotide is labeled, for instance, with ^^P-y-ATP using T4 polynucleotide 

kinase and purified according to routine methods. (E.g., Maniatis et al.. Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) 

20 The plasmid mixture is transformed into a suitable host, as indicated above (such as 
XL-1 Blue (Stratagene)) using techniques known to those of skill in the art, such as 
those provided by the vector supplier or in related publications or patents cited above. 
The transformants are plated on 1 .5% agar plates (containing the appropriate selection 
agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. 

25 These plates are screened using Nylon membranes according to routine methods for 
bacterial colony screening (e.g., Sambrook et al.. Molecular Cloning: A Laboratory 
Manual, 2nd Edit., (1989), Cold Spring Harbor Laboratory Press, pages 1.93 to 
1.104), or other techniques known to those of skill in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the 

30 SEQ ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5' NT and the 
3' NT of the clone defined in Table 1) are synthesized and used to amplify the desired 
cDNA using the deposited cDNA plasmid as a template. The polymerase chain reaction 
is carried out under routine conditions, for instance, in 25 \il of reaction mixture with 
0.5 ug of the above cDNA template. A convenient reaction mixture is 1.5-5 mM 

35 MgCl2, 0.01 % (w/v) gelatin, 20 ^M each of dATP, dCTP, dGTP, dTTP, 25 pmol of 
each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation 
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at 94''C for 1 min; annealing at SS^'C for 1 min; elongation at 72°C for 1 min) are 

performed with a Perkin-EImer Cetus automated thermal cycler. The amplified product 
is analyzed by agarose gel electrophoresis and the DNA band with expected molecular 
weight is excised and purified. The PCR product is verified to be the selected sequence 
5 by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 
portions of a gene which may not be present in the deposited clone. These methods 
include but are not limited to, filter probing, clone enrichment using specific probes, 
and protocols similar or identical to 5' and 3' "RACE" protocols which are well known 

10 in the art. For instance, a method similar to 5' RACE is available for generating the 

missing 5' end of a desired full-length transcript. (Fromont-Racine et al.. Nucleic Acids 
Res. 21(7):1683-1684 (1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a population 
of RNA presumably containing full-length gene RNA transcripts. A primer set 

15 containing a primer specific to the ligated RNA oligonucleotide and a primer specific to 
a known sequence of the gene of interest is used to PCR cunplify the 5* portion of the 
desired full-length gene. This amplified product may then be sequenced and used to 
generate the full length gene. 

This above method starts with total RNA isolated from the desired source, 

20 although poly-A+ RNA can be used. The RNA preparation can then be treated with 
phosphatase if necessary to eliminate 5' phosphate groups on degraded or damaged 
RNA which may interfere with the later RNA ligase step. The phosphatase should then 
be inactivated and the RNA treated with tobacco acid pyrophosphatase in order to 
remove the cap structure present at the 5* ends of messenger RNAs. This reaction 

25 leaves a 5' phosphate group at the 5' end of the cap cleaved RNA which can then be 
ligated to an RNA oligonucleotide using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PCR amplification of the desired 5' end using a primer specific to 

30 the ligated RNA oligonucleotide and a primer specific to the known sequence of the 
gene of interest. The resultant product is then sequenced and analyzed to confirm that 
the 5' end sequence belongs to the desired gene. 

Example 2: Isolation of Genomic Clones Corresponding to a 
35 Polynucleotide 
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A human genomic PI library (Genomic Systems, Inc.) is screened by PCR 
using primers selected for the cDNA sequence corresponding to SEQ ID NO:X., 
according to the method described in Example 1. (See also, Sambrook.) 

5 Example 3: Tissue Distribution of Polvpep tidp 

Tissue distribution of mRNA expression of polynucleotides of the present 
invention is determined using protocols for Northern blot analysis, described by. 
among others, Sambrook et al. For example, a cDNA probe produced by the method 
described in Example 1 is labeled with P^^ using the rediprimeTw dNA labeling system 

10 (Amersham Life Science), according to manufacturer's instructions. After labehng, the 
probe is purified using CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), 
according to manufacturer's protocol number PT1200-L The purified labeled probe is 
then used to examine various human tissues for mRNA expression. 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) or 

15 human immune system tissues (IM) (Clontech) are examined with the labeled probe 
using ExpressHyb^M hybridization solution (Clontech) according to manufacturer's 
protocol number PTl 190-1 . Follov^ing hybridization and washing, the blots are 
mounted and exposed to film at -VO'^C overnight, and the films developed according to 
standard procedures. 

20 

Example 4: Chromosomal Mapp in g of the Polvnucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5' 
end of SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This 
primer set is then used in a polymerase chain reaction under the following set of 

25 conditions : 30 seconds, 95°C; 1 minute, 56^C; 1 minute, 70^C. This cycle is repeated 
32 times followed by one 5 minute cycle at 70°C. Human, mouse, and hamster DNA 
is used as template in addition to a somatic cell hybrid panel containing individual 
chromosomes or chromosome fragments (Bios, Inc). The reactions is analyzed on 
either 8% polyacrylamide gels or 3.5 % agarose gels. Chromosome mapping is 

30 determined by the presence of an approximately 100 bp PCR fragment in the particular 
somatic cell hybrid. 

Example 5: Bacterial Expression of a Polypeptide 

A polynucleotide encoding a polypeptide of the present invention is ampUfied 
35 using PCR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA 
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sequence, as outlined in Example 1 , to synthesize insertion fragments. The primers 
used to amplify the cDNA insert should preferably contain restriction sites, such as 
BamHI and Xbal, at the 5' end of the primers in order to clone the amplified product 
into the expression vector. For example, BamHI and Xbal correspond to the restriction 
enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., Chatsworth, 

CA). This plasmid vector encodes antibiotic resistance ( AmpO, a bacterial origin of 
replication (ori), an IPTG-regulatable promoter/operator (P/O), a ribosome binding site 
(RBS), a 6-histidine tag (6-His), and restriction enzyme cloning sites. 

The pQE-9 vector is digested with BamHI and Xbal and the amplified fragment 
is ligated into the pQE-9 vector maintaining the reading frame initiated at the bacterial 
RBS. The ligation mixture is then used to transform the E. coli strain M15/rep4 
(Qiagen, Inc.) which contains multiple copies of the plasmid pREP4, which expresses 

the lad repressor and also confers kanamycin resistance (Kan^. Transformants are 
identified by their ability to grow on LB plates and ampicillin/kanamycin resistant 
colonies are selected. Plasmid DNA is isolated and confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1: 1(X) to 1:250. The 
cells are grown to an optical density 600 (O.D.^) of between 0.4 and 0.6. IPTG 
(Isopropyl-B-D-thiogalacto pyranoside) is then added to a final concentration of 1 mM. 
IPTG induces by inactivating the lad repressor, clearing the P/O leading to increased 
gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
centrifugation (20 mins at 6(X)0Xg). The cell pellet is solubilized in the chaotropic 

agent 6 Molar Guanidine HCl by stirring for 3-4 hours at 4''C. The cell debris is 

removed by centrifugation, and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (available fix)m 
QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high 
affinity and can be purified in a simple one-step procedure (for details see: The 
QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH 8, 
the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed 
with 10 volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide is eluted with 
6 M guanidine-HCl, pH 5. 

The purified protein is then renatured by dialyzing it against phosphate-buffered 
saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCl. Alternatively, the 
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protein can be successfully refolded while inunobilized on the Ni-NTA column. The 
recommended conditions are as follows: renature using a linear 6M-1M urea gradient in 
500 mM NaCl, 20% glycerol, 20 mM Tris/HCI pH 7.4, containing protease inhibitors. 
The renaturation should be performed over a period of 1.5 hours or more. After 
5 renaturation the proteins are eluted by the addition of 250 mM immidazole. Immidazole 
is removed by a final dialyzing step against PBS or 50 mM sodium acetate pH 6 buffer 
plus 200 mM NaCl. The purified protein is stored at 4°C or frozen at -SOTC. 

In addition to the above expression vector, the present invention further includes 
an expression vector comprising phage operator and promoter elements operatively 

10 linked to a polynucleotide of the present invention, called pHE4a. (ATCC Accession 
Number 209645, deposited on February 25, 1998.) This vector contains: 1) a 
neomycinphosphou-ansferase gene as a selection marker, 2) an E. coli origin of 
replication, 3) a T5 phage promoter sequence, 4) two lac operator sequences, 5) a 
Shine-Delgarno sequence, and 6) the lactose operon repressor gene (laclq). The origin 

15 of replication (oriC) is derived from pUC19 (LTI, Gaithersburg, MD). The promoter 
sequence and operator sequences are made synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and 
Xbal, BamHI, Xhol, or Asp718, running the restricted product on a gel, and isolating 
the larger fragment (the stuffer fragment should be about 310 base pairs). The DNA 

20 insert is generated according to the PCR protocol described in Example 1, using PGR 
primers having restriction sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or 
Asp718 (3' primer). The PCR insert is gel purified and restricted with compatible 
enzymes. The insert and vector are ligated according to standard protocols. 

The engineered vector could easily be substituted in the above protocol to 

25 express protein in a bacterial system. 

Example 6r Purification of a PolvpeDtide from an Inclusion Bodv 

The following alternative method can be used to purify a polypeptide expressed 
in E coli when it is present in the form of inclusion bodies. Unless otherwise specified, 
30 all of the following steps are conducted at 4-10**C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10'*C and the cells harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate amount 
35 of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 
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mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a 
high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer 
(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
5 then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine 
hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., the 

10 pellet is discarded and the polypeptide containing supernatant is incubated at 4°C 
overnight to allow further GuHCI extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, 
the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 
volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 

15 vigorous stirring. The refolded diluted protein solution is kept at 4°C without mixing 
for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 
filtration unit equipped with 0.16 |am membrane filter with appropriate surface area 
(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 

20 filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted 
with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a 
stepwise manner. The absorbance at 280 nm of the effluent is continuously monitored. 
Fractions are collected and further analyzed by SDS-PAGE. 

25 Fractions containing the polypeptide are then pooled and mixed with 4 volumes 

of water. The diluted sample is then loaded onto a previously prepared set of tandem 
colunms of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion 
(Pores CM-20, Perseptive Biosystems) exchange resins. The colunms are equilibrated 
with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium 

30 acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eluted using a 10 column 
volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1 .0 
M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected under constant Ao»n 
monitoring of the effluent. Fractions containing the polypeptide (determined, for 
instance, by 16% SDS-PAGE) are then pooled. 
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The resultant polypeptide should exhibit greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands should be observed from 

Commassie blue stained 16% SDS-PAGE gel when 5 ^lg of purified protein is loaded. 

The purified protein can also be tested for endotoxin/LPS contamination, and typically 
5 the LPS content is less than 0.1 ng/ml according to LAL assays. 

Example 7; Cloning and Expression of a Polypeptide in a Baculovirus 

Expression System 

In this example, the plasmid shuttle vector pA2 is used to insert a polynucleotide 

10 into a baculovirus to express a polypeptide. This expression vector contains the strong 
polyhedrin promoter of the Autographa califomica nuclear polyhedrosis virus 
(AcMNPV) followed by convenient restriction sites such as BamHI, Xba I and 
Asp718. The polyadenylation site of the simian virus 40 ("SV40") is used for efficient 
polyadenylation. For easy selection of recombinant virus, the plasmid contains the 

15 beta-galactosidase gene from E. coli under control of a weak Drosophila promoter in the 
same orientation, followed by the polyadenylation signal of the polyhedrin gene. The 
inserted genes are flanked on both sides by viral sequences for cell-mediated 
homologous recombination with wild-type viral DNA to generate a viable vims that 
express the cloned polynucleotide. 

20 Many other baculovirus vectors can be used in place of the vector above, such 

as pAc373, pVL941, and pAcIMl, as one skilled in the art would readily appreciate, as 
long as the construct provides appropriately located signals for transcription, 
translation, secretion and the like, including a signal peptide and an in-frame AUG as 
required. Such vectors are described, for instance, in Luckow et al.. Virology 170:31- 

25 39 (1989). 

Specifically, the cDNA sequence contained in the deposited clone, including the 
AUG initiation codon and the naturally associated leader sequence identified in Table 1 , 
is amplified using the PCR protocol described in Example 1. If the naturally occurring 
signal sequence is used to produce the secreted protein, the pA2 vector does not need a 
30 second signal peptide. Alternatively, the vector can be modified (pA2 GP) to include a 
baculovirus leader sequence, using the standard methods described in Summers et al., 
"A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures," 
Texas Agricultural Experimental Station Bulletin No. 1555 (1987). 
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The amplified fragment is isolated from a 1 % agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1 % agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and 
5 optionally, can be dephosphorylated using calf intestinal phosphatase, using routine 
procedures known in the art. The DNA is then isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). 

The fragment and the dephosphorylated plasmid are ligated together with T4 
DNA ligase. E, coli HB 101 or other suitable E, coli hosts such as XL-1 Blue 
10 (Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 

mixture and spread on culture plates. Bacteria containing the plasmid are identified by 
digesting DNA from individual colonies and analyzing the digestion product by gel 
electrophoresis. The sequence of the cloned fragment is confirmed by DNA 
sequencing. 

^5 Five |Lig of a plasmid containing the polynucleotide is co-transfected with 1.0 ^g 

of a commercially available linearized baculovirus DNA ("BacuIoGold™ baculovirus 
DNA", Pharmingen, San Diego, CA), using the lipofection method described by 
Feigner et al., Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). One ^ig of 
BaculoGold^M vims DNA and 5 |xg of the plasmid are mixed in a sterile well of a 

20 microliter plate containing 50 jil of serum-free Grace's medium (Life Technologies 

Inc., Gaithersburg, MD). Afterwards, 10 \i\ Lipofectin plus 90 \i\ Grace's medium are 
added, mixed and incubated for 15 minutes at room temperature. Then the transfection 
mixture is added drop-wise to Sf9 insect cells (ATCC CRL 1711) seeded in a 35 mm 
tissue culture plate with 1 ml Grace's medium without serum. The plate is then 

25 incubated for 5 hours at 27'' C. The transfection solution is then removed from the plate 
and 1 ml of Grace's insect medium supplemented with 10% fetal calf serum is added. 
Cultivation is then continued at 27° C for four days. 

After four days the supernatant is collected and a plaque assay is performed, as 
described by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life 

30 Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of 

gal-expressing clones, which produce blue-stained plaques. (A detailed description of a 
"plaque assay" of this type can also be found in the user's guide for insect cell culture 
and baculovirology distributed by Life Technologies Inc., Gaithersburg, page 9-10.) 
After appropriate incubation, blue stained plaques are picked with the tip of a 

35 micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
resuspended in a microcentrifuge tube containing 2(X) |li1 of Grace's medium and the 
suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded in 
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35 mm dishes. Four days later the supematants of these culture dishes are harvested 

and then they are stored at 4° C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's 

medium supplemented with 10% heat-inactivated FBS. The cells are infected with the 
5 recombinant baculovirus containing the polynucleotide at a multiplicity of infection 

("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later the medium is 

removed and is replaced with SF900 II medium minus methionine and cysteine 

(available from Life Technologies Inc., Rockville, MD). After 42 hours, 5 |iCi of ^^S- 

methionine and 5 |xCi ^^S-cysteine (available from Amersham) are added. The cells are 
10 further incubated for 16 hours and then are harvested by centrifugation. The proteins 

in the supernatant as well as the intracellular proteins are analyzed by SDS-PAGE 

followed by autoradiography (if radiolabeled). 

Micro sequencing of the amino acid sequence of the amino terminus of purified 

protein may be used to determine the amino terminal sequence of the produced 
15 protein. 

Example 8: Expression of a Polypeptide in Mammalian Cells 

The polypeptide of the present invention can be expressed in a mammalian cell. 
A typical mammalian expression vector contains a promoter element, which mediates 
the initiation of transcription of mRNA, a protein coding sequence, and signals required 

20 for the termination of transcription and polyadenylation of the transcript. Additional 
elements include enhancers, Kozak sequences and intervening sequences flanked by 
donor and acceptor sites for RNA splicing. Highly efficient transcription is achieved 
with the early and late promoters from SV40, the long terminal repeats (LTRs) from 
Retroviruses, e.g., RSV, HTLVI, HIVI and the early promoter of the cytomegalovirus 

25 (CMV). However, cellular elements can also be used (e.g., the human actin promoter). 

Suitable expression vectors for use in practicing the present invention include, 
for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), 
pRSVcat (ATCC 37152), pSV2dhfr (ATCC 37146), pBC12Ml (ATCC 67109), 
pCMVSport 2.0, and pCMVSport 3.0. Manmialian host cells that could be used 

30 include, human Hela, 293, H9 and Jurkat cells, mouse NIH3T3 and CI 27 cells, Cos 1 , 
Cos 7 and CVl, quail QCl-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
cells. 
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Alternatively, the polypeptidfe can be expressed in stable cell lines containing the 
polynucleotide integrated into a chromosome. The co-transfection with a selectable 
marker such as dhfr, gpt, neomycin, hygromycin allows the identification and isolation 
of the transfected cells. 
5 The transfected gene can also be amplified to express large amounts of the 

encoded protein. The DHFR (dihydrofoiate reductase) marker is useful in developing 
cell lines that cany several hundred or even several thousand copies of the gene of 
interest. (See, e.g., Alt, F. W., et al., J. Biol. Chem. 253:1357-1370 (1978); Hamlin, 
J. L. and Ma, C, Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and 
10 Sydenham, M. A., Biotechnology 9:64-68 (1991).) Another useful selection marker is 
the enzyme glutamine synthase (GS) (Murphy et al., Biochem J. 227:277-279 (1991); 
Bebbington et al., Bio/Technology 10: 169-175 (1992). Using these markers, the 
manraialian cells are grown in selective medium and the cells with the highest resistance 
are selected. These cell lines contain the amplified gene(s) integrated into a 
15 chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
No.209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et 
20 al.. Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the 
CMV-enhancer (Boshart et al.. Cell 41:521-530 (1985).) Multiple cloning sites, e.g., 
with the restriction enzyme cleavage sites BamHI, Xbal and Asp718, facilitate the 
cloning of the gene of interest. The vectors also contain the 3' intron, the 
polyadenylation and termination signal of the rat preproinsulin gene, and the mouse 
25 DHFR gene under control of the S V40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphoiylated using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a 1% agarose gel. 

A polynucleotide of the present invention is amplified according to the protocol 
30 outlined in Example 1 . If the naturally occurring signal sequence is used to produce the 
secreted protein, the vector does not need a second signal peptide. Alternatively, if the 
naturally occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 

The amphfied fragment is isolated from a 1% agarose gel using a commercially 
35 available kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1 % agarose gel. 
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The amplified fragment is then digested with the same restriction enzyme and 
purified on a 1 % agarose gel. The isolated fragment and the dephosphorylated vector 
are then ligated with T4 DNA ligase. E. coli HBlOl or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 

5 pC6 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for 
transfection. Five \ig of the expression plasmid pC6 is cotransfected with 0.5 ^ig of the 
plasmid pSVneo using lipofectin (Feigner et al., supra). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme that 

10 confers resistance to a group of antibiotics including G418. The cells are seeded in 
alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 
MEM supplemented with 10, 25, or 50 ng/ml of metothrexate plus 1 mg/ml G418. 
After about 10-14 days single clones are trypsinized and then seeded in 6- well petri 

15 dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 100 nM, 
200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
concentrations of methotrexate (1 jiM, 2 )iM, 5 |iM, 10 mM, 20 mM). The same 
procedure is repeated until clones are obtained which grow at a concentration of 100 - 

20 200 fiM. Expression of the desired gene product is analyzed, for instance, by SDS- 
PAGE and Western blot or by reversed phase HPLC analysis. 

Example 9; Protein Fusions 



25 These fusion proteins can be used for a variety of applications. For example, fusion of 
the present polypeptides to His-tag, HA-tag, protein A, IgG domains, and maltose 
binding protein facilitates purification. (See Example 5; see also EP A 394,827; 
Traunecker, ct al.. Nature 331:84-86 (1988).) Similarly, fusion to IgG-1, IgG-3, and 
albumin increases the halflife time in vivo. Nuclear localization signals fused to the 

30 polypeptides of the present invention can target the protein to a specific subcellular 
localization, while covalent heterodimer or homodimers can increase or decrease the 
activity of a fusion protein. Fusion proteins can also create chimeric molecules having 
more than one function. Finally, fusion proteins can increase solubility and/or stabihty 
of the fused protein compared to the non-fused protein. All of the types of fusion 

35 proteins described above can be made by modifying the following protocol, which 
outlines the fusion of a polypeptide to an IgG molecule, or the protocol described in 
Example 5. 



The polypeptides of the present invention are preferably fused to other proteins. 
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Briefly, the human Fc portion of the IgG molecule can be PGR amplified, using 
primers that span the 5' and 3' ends of the sequence described below. These primers 
also should have convenient restriction enzyme sites that will facilitate cloning into an 
expression vector, preferably a manrunalian expression vector. 
5 For example, if pC4 (Accession No. 209646) is used, the human Fc portion can 

be ligated into the BamHI cloning site. Note that the 3' BamHI site should be 
destroyed. Next, the vector containing the human Fc portion is re-restricted with 
BamHI, linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PGR protocol described in Example 1 , is ligated into this BamHI site. Note that 
10 the polynucleotide is cloned without a stop codon, otherwise a fusion protein will not 
be produced. 

If the naturally occurring signal sequence is used to produce the secreted 
protein, pC4 does not need a second signal peptide. Altematively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
15 heterologous signal sequence. (See, e.g., WO 96/34891 .) 

Human IgG Fc region: 

.GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCCACCGTGCC 
CAGCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCCCCCCAAAACC 

20 CAAGGACACCCTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGTGGT 
GGACGTAAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACG 
GCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAAC 
AGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTG 
AATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAACCCCC 

25 ATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGT 
GTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCT 
GACCTGCCTGGTCAAAGGCTTCTATCCAAGCGACATCGCCGTGGAGTGGGA 
GAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGG 
ACTCCGACGGCTCCnTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAG 

30 GGTGGCAGCAGGGGAACGTCnrCTCATGCTCCGTGATGCATGAGGCTCTGC 
ACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATGAGTGC 
GACGGCCGCGACTCTAGAGGAT (SEQ ID NO: 1) 

Example 10: Production of an Antibody from a Polvneptide 

35 The antibodies of the present invention can be prepared by a variety of methods. 

(See, Current Protocols, Chapter 2.) For example, cells expressing a polypeptide of 
the present invention is administered to an animal to induce the production of sera 
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containing polyclonal antibodies. In a preferred method, a preparation of the secreted 
protein is prepared and purified to render it substantially free of natural contaminants. 
Such a preparation is then introduced into an animal in order to produce polyclonal 
antisera of greater specific activity. 



monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al.. Nature 
256:495 (1975); Kohler et al., Eur. J. Immunol. 6:511 (1976); Kohler et al., Eur. J. 
Immunol. 6:292 (1976); Hammerling et al., in: Monoclonal Antibodies and T-Cell 
10 Hybridomas, Elsevier, N.Y., pp. 563-681 (1981).) In general, such procedures 
involve immunizing an animal (preferably a mouse) with polypeptide or, more 
preferably, with a secreted polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is preferable to cuhure cells in Earle's 
modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at 

15 about 56''C), and supplemented with about 10 g/1 of nonessential amino acids, about 

1,000 U/ml of penicillin, and about 100 \ig/nA of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention; however, it is preferable to employ the parent myeloma cell line 

20 (SP20), available from the ATCC. After fusion, the resulting hybridoma cells are 
selectively maintained in HAT medium, and then cloned by limiting dilution as 
described by Wands et al. (Gastroenterology 80:225-232 (1981).) The hybridoma cells 
obtained through such a selection are then assayed to identify clones which secrete 
antibodies capable of binding the polypeptide. 

25 Alternatively, additional antibodies capable of binding to the polypeptide can be 

produced in a two-step procedure using anti-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is 
possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 

30 mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an antibody 
whose ability to bind to the protein-specific antibody can be blocked by the polypeptide. 
Such antibodies comprise anti-idiotypic antibodies to the protein-specific antibody and 
can be used to immunize an animal to induce formation of further protein-specific 

35 antibodies. 



5 



In the most preferred method, the antibodies of the present invention are 
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It will be appreciated that Fab and F(ab*)2 and other fragments of the antibodies 
of the present invention may be used according to the methods disclosed herein. Such 
fragments are typically produced by proteolytic cleavage, using enzymes such as papain 
(to produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, 

5 secreted protein-binding fragments can be produced through the application of 
recombinant DNA technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced using 
genetic constructs derived from hybridoma cells producing the monoclonal antibodies 

10 described above. Methods for producing chimeric antibodies are known in the art. 

(See, for review, Morrison, Science 229:1202 (1985); Oi et al., BioTechniques 4:214 
(1986); Cabilly et aL, U.S. Patent No. 4,816,567; Taniguchi et al., EP 171496; 
Morrison et al., EP 173494; Neuberger et al., WO 8601533; Robinson et al., WO 
870267 1 ; Boulianne et al.. Nature 3 12:643 (1984); Neuberger et al.. Nature 3 14:268 

15 (1985).) 



Example 11: Production Of Secreted Protein For High-Throughput 
Screening Assays 

The following protocol produces a supernatant containing a polypeptide to be 
20 tested. This supernatant can then be used in the Screening Assays described in 
Examples 13-20. 

First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(Img/ml in PBS) 1:20 in PBS (w/o calcium or magnesium 17-5 16F Biowhittaker) for a 
working solution of 50ug/ml. Add 200 ul of this solution to each well (24 well plates) 

25 and incubate at RT for 20 minutes. Be sure to distribute the solution over each well 

(note: a 12-channel pipetter may be used with tips on every other channel). Aspirate off 
the Poly-D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered Saline). The 
PBS should remain in the well until just prior to plating the cells and plates may be 
poly-lysine coated in advance for up to two weeks. 

30 Plate 293T cells (do not carry cells past P+20) at 2 x 10^ cells/well in .5ml 

DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine 
(12-604F Biowhittaker))/10% heat inactivated FBS(14-503F Biowhittaker)/ Ix 
Penstrep(17-602E Biowhittaker). Let the cells grow ovemight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 

35 (18324-012 Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. 

With a small volume multi-channel pipetter, aliquot approximately 2ug of an expression 
vector containing a polynucleotide insert, produced by the methods described in 
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Examples 8 or 9, into an appropriately labeled 96-well round bottom plate. With a 
multi-channel pipetter, add 50ul of the Lipofectamine/Optimem I mixture to each well. 
Pipette up and down gently to mix. Incubate at RT 15-45 minutes. After about 20 
minutes, use a multi-channel pipetter to add 150ul Optimem I to each well. As a 
5 control, one plate of vector DNA lacking an insert should be transfected with each set of 
transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too 
much time on PBS. First, person A aspirates off the media from four 24-well plates of 
1 0 cells, and then person B rinses each well with .5- 1 ml PBS. Person A then aspirates off 
PBS rinse, and person B, using al2-channel pipetter with tips on every other channel, 
adds the 200ul of DNA/Lipofectamine/Optimem I complex to the odd wells first, then to 
the even wells, to each row on the 24-well plates. Incubate at 37°C for 6 hours. 

While cells are incubating, prepare appropriate media, either 1%BSA in DNdEM 
15 with Ix penstrep, or CHO-5 media (1 16.6 mg/L of CaC12 (anhyd); 0.00130 mg/L 
CUSO4-5H2O; 0.050 mg/L of Fe(N03)3-9H20; 0.417 mg/L of FeSO^-VHjO; 31 1.80 
mg/L of Kcl; 28.64 mg/L of MgCl^; 48.84 mg/L of MgSO^; 6995.50 mg/L of NaCl; 
2400.0 mg/L of NaHC03; 62.50 mg/L of NaH^PO.-H^O; 71.02 mg/L of Na2HP04; 
.4320 mg/L of ZnS04-7H20; .002 mg/L of Arachidonic Acid ; 1.022 mg/L of 
20 Cholesterol; .070 mg/L of DL-alpha-Tocopherol- Acetate; 0.0520 mg/L of Linoleic 

Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of Myristic Acid; 0.010 mg/L of Oleic 
Acid; 0.010 mg/L of Palmiuic Acid; 0.010 mg/L of Palmitic Acid; 100 mg/L of 
Pluronic F-68; 0.010 mg/L of Stearic Acid; 2.20 mg/L of Tween 80; 455 1 mg/L of D- 
Glucose; 130.85 mg/ml of L- Alanine; 147.50 mg/ml of L-Arginine-HCL; 7.50 mg/ml 
25 of L-Asparagine-HjO; 6.65 mg/ml of L-Aspartic Acid; 29.56 mg/ml of L-Cystine- 

2HCL-H2O; 31.29 mg/ml of L-Cystine-2HCL; 7.35 mg/ml of L-Glutamic Acid; 365.0 
mg/ml of I^Glutamine; 18.75 mg/ml of Glycine; 52.48 mg/ml of L-Histidine-HCL- 
HjO; 106.97 mg/ml of L-Isoleucine; 1 1 1.45 mg/ml of L-Leucine; 163.75 mg/ml of L- 
Lysine HCL; 32.34 mg/ml of L-Methionine; 68.48 mg/ml of L-Phenylalainine; 40.0 
mg/ml of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of L-Thrconine; 19.22 
mg/ml of L-Tryptophan; 91.79 mg/ml of L-Tiyrosine-2Na-2H30; 99.65 mg/ml of L- 
Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 1 1 .78 mg/L of 
Choline Chloride; 4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of 
Niacinamide; 3.00 mg/L of Pyridoxal HCL; 0.031 mg/L of Pyridoxine HCL; 0.319 
mg/L of Riboflavin; 3. 1 7 mg/L of Thiamine HCL; 0.365 mg/L of Thymidine; and 
0.680 mg/L of Vitamin B.j; 25 mM of HEPES Buffer; 2.39 mg/L of Na Hypoxanthine; 



30 



35 
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0.105 mg/L of Lipoic Acid; 0,081 mg/L of Sodium Putrescine-2HCL; 55.0 mg/L of 
Sodium Pyruvate; 0.0067 mg/L of Sodium Selenite; 20uM of Ethanolamine; 0.122 
mg/L of Ferric Citrate; 4L70 mg/L of Methyl-B-Cyclodextrin complexed with Linoleic 
Acid; 3333 mg/L of Methyl-B-Cyclodextrin complexed with Oleic Acid; and 10 mg/L 
5 of Methyl-B-Cyclodextrin complexed with Retinal) with 2mm glutamine and Ix 

penstrep. (BSA (81-068-3 Bayer) lOOgm dissolved in IL DMEM for a 10% BSA stock 
solution). Filter the media and collect 50 ul for endotoxin assay in 15ml polystyrene 
conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the end of 
10 the incubation period. Person A aspirates off the transfection media, while person B 
adds 1.5ml appropriate media to each well. Incubate at 37°C for 45 or 72 hours 

depending on the media used: 1%BSA for 45 hours or CHO-5 for 72 hours. 

On day four, using a 300ul multichannel pipetter, aliquot 600ul in one 1ml deep 
well plate and the remaining supernatant into a 2ml deep well. The supematants from 

15 each well can then be used in the assays described in Examples 13-20. 

It is specifically understood that when activity is obtained in any of the assays 
described below using a supernatant, the activity originates from either the polypeptide 
directly (e.g., as a secreted protein) or by the polypeptide inducing expression of other 
proteins, which are then secreted into the supernatant. Thus, the invention further 

20 provides a method of identifying the protein in the supernatant characterized by an 
activity in a particular assay. 

Example 12; Con struction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and proliferation 
25 of cells is called the Jaks-STATs pathway. Activated proteins in the Jaks-STATs 
pathway bind to gamma activation site "GAS" elements or interferon-sensitive 
responsive element ("ISRE"), located in the promoter of many genes. The binding of a 
protein to these elements alter the expression of the associated gene. 

GAS and ISRE elements are recognized by a class of transcription factors called 
30 Signal Transducers and Activators of Transcription, or "STATs." There are six 

members of the STATs family. Statl and Stat3 are present in many cell types, as is 
Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 
many cell types though it has been found in T helper class I, cells after treatment with 
IL-12. StatS was originally called mammary growth factor, but has been found at 
35 higher concentrations in other cells including myeloid cells. It can be activated in tissue 
culture cells by many cytokines. 
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The STATs are activated to translocate froin the cytoplasm to the nucleus upon 
tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
family. Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2. 
Jakl, Jak2, and Jak3. These kinases display significant sequence similarity and are 
5 generally catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table 
below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:62 1-5 1 
(1995).) A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, IL-9, IL-1 1. IL- 
10 1 2. IL- 1 5, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and 
(b) Class 2 includes IFN-a, IFN-g, and E.-10. The Class 1 receptors share a 
conserved cysteine motif (a set of four conserved cysteines and one tryptophan) and a 
WSXWS motif (a membrane proximal region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID 
NO:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
activate STATs, which then translocate and bind to GAS elements. This entire process 
is encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
the GAS or the ISRE element, can be used to indicate proteins involved in the 
20 proliferation and differentiation of cells. For example, growth factors and cytokines are 
known to activate the Jaks-STATs pathway. (See Table below.) Thus, by using GAS 
elements linked to reporter molecules, activators of the Jaks-STATs pathway can be 
identified. 
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Ligand 

IFN family 
IFN-a/B 
IFN-g 
11-10 

S 130 family 
-6 (Pleiotrophic) 
IM 1 (Pleiotrophic) 
OnM(Pleioirophic) 
LIF(Pleiotrophic) 
CNTF(Pleioirophic) 
G-CSF(Pleiotrophic) 
IL-12(Pleioirophic) 

K C family 
'2 (Jymphocytes) 
IL-4 (lymph/myeloid 
lL-7 (lymphocytes) 
IL-9 (lymphocytes) 
IL-13 (lymphocyte) 
IL-15 ' 

gpl40 family 
IL-3 (myeloid) 
IL-5 (myeloid) 
GM-CSF (myeloid) 





JAKs 






STATS 


GASfeiements) or ISRE 


tvk2 


Jaki 


Jak2 


Jak3 








+ 




- 


U2.3 


ISRE 






+ 




1 


GAS (IRFl>Lys6>IFP) 


+ 


7 


o 


- 


13 








-¥ 


7 


K3 


GAS (IRFl>Lys6>IFP) 


7 






7 


13 






+ 


+ 


7 


1,3 




7 




+ 


7 


K3 




-/+ 




+ 


7 


13 




7 


+ 


7 


7 


13 




+ 




+ 


+ 


13 






+ 




+ 


13,5 


GAS 




+ 




+ 


6 


GAS (IRFl = IFP »Lv6)(IgH) 




+ 




+ 


5 


GAS 








+ 


5 


GAS 




+ 


7 


7 


6 


GAS 


7 




7 


+ 


5 


GAS 



Growth hormone family 
CH ~' ? 
PRL ? 
EPO ? 



Receptor Tyrosine K inases 

WjT T~ 

PDGF ? 
CSF-1 ? 



+ 
+ 



+ 
+ 



+ 
+ 
+ 



5 GAS (IRFl>IFP»Ly6) 
5 GAS 
5 GAS 



5 

1,3.5 

5 



1.3 
1,3 
1,3 



GAS(B-CAS>IRF1 =IFP»Ly6) 

GAS flRFl) 
GAS (not IRFl) 
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To construct a synthetic GAS containing promoter element, which is used in the 
Biological Assays described in Examples 13-14, a PGR based strategy is employed to 
generate a GAS-SV40 promoter sequence. The 5' primer contains four tandem copies 
of the GAS binding site found in the IRFl promoter and previously demonstrated to 
bind STATs upon induction with a range of cytokines (Rothman et al.. Immunity 
1:457-468 (1994).), although other GAS or ISRE elements can be used instead. The 5' 
primer also contains 18bp of sequence complementary to the SV40 early promoter 
sequence and is flanked with an Xhol site. The sequence of the 5' primer is: 

5 ' :GCGCCTCG AGATTTCCCCGAAATCTAGATTTCCCCG AA ATGATTTCCCCG 
AAATGATTrCCCCGAAATATCTGCCATCrCAATrAG:3' (SEQ ID NO:3) 

The downstream primer is complementary to the SV40 promoter and is flanked 
with a Hind III site: 5':GCGGCAAGCTTTTTGCAAAGCCTAGGC:3' (SEQ ID 
NO:4) 

PGR amplification is performed using the SV40 promoter template present in 
the B-gaI:promoter plasmid obtained from Clontech. The resulting PGR fragment is 
digested with Xhol/Hind III and subcloned into BLSK2-. (Stratagene.) Sequencing 
with forward and reverse primers confirms that the insert contains the following 
sequence: 

5^£ICQAQATTTCGCGGAAATCTAGATTTGGGCGAAATGATrrGCCCGAAATG 

20 ATTTGCCGGAAATATCTGCCATCrCAATTAGTGAGCAACCATAGTCGGGCGC 

CTAACTGCGGCCATCCCGCGCCTAACTCCGGGGAGTTCCGCCGATTCTCCGG 

CCCATGGCTGACTAAlllTirnATTTATGGAGAGGCCGAGGCCGCCTCGGC 

CTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTITGGAGGCCTAGGCTTr 
TGCAAA AAGCTT :3' (SEQIDNO:5) 

25 With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 

reporter construct is next engineered. Here, the reporter molecule is a secreted alkaline 
phosphatase, or "SEAP." Clearly, however, any reporter molecule can be instead of 
SEAP, in this or in any of the other Examples. Well known reporter molecules that can 
be used instead of SEAP include chloramphenicol acetyltransferase (CAT), luciferase, 

30 alkaline phosphatase, B-galactosidase, green fluorescent protein (GFP). or any protein 
detectable by an antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hindlll and 
Xhol, effectively replacing the SV40 promoter with the amplified GAS:SV40 promoter 

35 element, to create the GAS-SEAP vector. However, this vector does not contain a 
neomycin resistance gene, and therefore, is not preferred for mammalian expression 
systems. 
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Thus, in order to generate mammalian stable cell lines expressing the GAS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using 
Sail and NotI, and inserted into a backbone vector containing the neomycin resistance 
gene, such as pGFP-1 (Clontech), using these restriction sites in the multiple cloning 
5 site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 

mammalian cells, this vector can then be used as a reporter molecule for GAS binding 
as described in Examples 13-14. 

Other constructs can be made using the above description and replacing GAS 
with a different promoter sequence. For example, construction of reporter molecules 

10 containing NFK-B and EGR promoter sequences are described in Examples 15 and 16. 
However, many other promoters can be substituted using the protocols described in 
these Examples. For instance, SRE, IL-2, NFAT, or Osteocalcin promoters can be 
substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-ICB, II- 
2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to test reporter 

15 construct activity, such as HELA (epithelial), HUVEC (endothelial), Reh (B-cell), 
Saos-2 (osteoblast), HUVAC (aortic), or Cardiomyocyte. 

Example 13: High-Throughput Screening Assay for T-cell Activity. 

The following protocol is used to assess T-cell activity by identifying factors, 

20 such as growth factors and cytokines, that may proliferate or differentiate T-cells. T- 
cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. 
Thus, factors that increase SEAP activity indicate the ability to activate the Jaks-STATS 
signal transduction pathway. The T-cell used in this assay is Jurkat T-cells (ATCC 
Accession No. TIB-152), although Molt-3 cells (ATCC Accession No. CRL-1552) and 

25 Molt-4 cells (ATCC Accession No. CRL- 1582) cells can also be used. 

Jurkat T-cells are lymphoblastic CIM+ Thl helper cells. In order to generate 
stable cell lines, approximately 2 million Jurkat cells are transfected with the GAS- 
SEAP/neo vector using DMRIE-C (Life Technologies)(transfection procedure 
described below). The transfected cells are seeded to a density of approximately 

30 20,000 cells per well and transfectants resistant to 1 mg/ml genticin selected. Resistant 
colonies are expanded and then tested for their response to increasing concentrations of 
interferon gamma. The dose response of a selected clone is demonstrated. 

Specifically, the following protocol will yield sufficient cells for 75 wells 
containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 

35 generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in RPMI 
+ 10% serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life Technologies) 
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with 10 ug of plasmid DNA in a T25 flask. Add 2,5 ml OPTI-MEM containing 50 ul 
of DMRIE-C and incubate at room temperature for 15-45 mins. 

During the incubation period, count cell concentration, spin down the required 
number of cells (10^ per transfection), and resuspend in OPTI-MEM to a final 
concentration of 10^ cells/ml. Then add 1ml of 1 x 10^ cells in OPTI-MEM to T25 flask 
and incubate at 37°C for 6 hrs. After the incubation, add 10 ml of RPMI + 15% serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% 
serum, 1 mg/ml Genticin, and 1% Pen-Strep. These cells are treated with supernatants 
containing a polypeptide as produced by the protocol described in Example 11. 

On the day of treatment with the supernatant, the cells should be washed and 
resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The 
exact number of cells required will depend on the number of supernatants being 
screened. For one 96 well plate, approximately 10 million cells (for 10 plates, 100 
million cells) are required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the cells into 
a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, transfer 200 ul 
of cells into each well (therefore adding 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supernatants are transferred 
directly from the 96 well plate containing the supernatants into each well using a 12 
channel pipette. In addition, a dose of exogenous interferon gamma (0.1, 1.0, 10 ng) 
is added to wells H9, HIO, and HI 1 to serve as additional positive controls for the 
assay. 

The 96 well dishes containing Jurkat cells treated with supernatants are placed in 
an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul samples 
from each well are then transferred to an opaque 96 well plate using a 12 channel 
pipette. The opaque plates should be covered (using sellophene covers) and stored at - 
20^C until SEAP assays are performed according to Example 17. The plates 
containing the remaining treated cells are placed at 4^C and serve as a source of material 
for repeating the assay on a speciflc well if desired. 

As a positive control, 100 Unit/ml interferon gamma can be used which is 
known to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
positive control wells. 

The above protocol may be used in the generation of both transient, as well as, 
stable transfected cells, which would be apparent to those of skill in the art. 
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Example 14: High^Throughput Screening Assay Identifying Myeloid 
Actiyitv 

The following protocol is used to assess myeloid activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate myeloid cells. 

5 Myeloid cell activity is assessed using the GAS/SEAP/Neo construct produced in 

Example 12. Thus, factors that increase SEAP activity indicate the ability to activate the 
Jaks-STATS signal transduction pathway. The myeloid cell used in this assay is U937, 
a pre-monocyte cell line, although TF-1, HL60, or KGl can be used. 

To transiently transfect U937 cells with the GAS/SEAP/Neo construct produced 

10 in Example 12, a DEAE-Dextran method (Kharbanda et. al., 1994, Cell Growth & 
Differentiation, 5:259-265) is used. First, harvest 2xl0e7 U937 cells and wash with 
PBS. The U937 cells are usually grown in RPMI 1640 medium containing 10% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 
mg/ml streptomycin. 

15 Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 

0.5 mg/ml DEAE-Dextran, 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCl, 5 mM 
KCl, 375 uM Na2HP04.7H20, 1 mM MgCl2, and 675 uM CaCl2. Incubate at 37^0 
for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 

20 resuspend in 10 ml complete medium and incubate at 31^C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 

ug/ml G418. The G418-free medium is used for routine growth but every one to two 

months, the cells should be re-grown in 400 ug/ml G418 for couple of passages. 

g 

These cells are tested by harvesting 1x10 cells (this is enough for ten 96- well 
25 plates assay) and wash with PBS. Suspend the cells in 200 ml above described growth 
medium, with a final density of 5x10'^ cells/ml. Plate 200 ul cells per well in the 96- 
well plate (or 1x10^ cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 11. 

Incubate at 37^C for 48 to 72 hr. As a positive control, 100 Unit/ml interferon gamma 
30 can be used which is known to activate U937 cells. Over 30 fold induction is typically 
observed in the positive control wells. SEAP assay the supernatant according to the 
protocol described in Example 17. 

Example 15: High-Throughput Screening Assay Identifying Neuronal 
35 Activity. 
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When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction pathways. One of these genes, 
EGRl (early growth response gene 1), is induced in various tissues and cell types upon 
activation. The promoter of EGRl is responsible for such induction. Using the EGRl 
promoter linked to reporter molecules, activation of cells can be assessed. 

Particularly, the following protocol is used to assess neuronal activity in PC 12 
cell lines. PC 12 cells (rat phenochromocytoma cells) are known to proliferate and/or 
differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl 
phoibol acetate), NGF (nerve growth factor), and EGF (epidermal growth factor). The 
EGRl gene expression is activated during this treatment. Thus, by stably transfecUng 
PC 12 cells with a construct containing an EGR promoter linked to SEAP reporter, 
activation of PC 12 cells can be assessed. 

The EGR/SEAP reporter construct can be assembled by the following protocol. 
The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et al.. Oncogene 6:867-871 
(1991)) can be PCR amplified from human genomic DNA using the following primers: 
5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQ ID NO:6) 
5' GCGAAGCTTCGCGACTCCCCGGATCCGCCTC-3' (SEQIDNO:7) 
Using the GAS:SEAP/Neo vector produced in Example 12, EGRl amplified 
product can then be inserted into this vector. Linearize the GAS:SEAP/Neo vector 
using restriction enzymes XhoI/HindHI, removing the GAS/SV40 stuffer. Restrict the 
EGRl amplified product with these same enzymes. Ligate the vector and the EGRl 
promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1:30 
dilution of collagen type I (Upstate Biotech Inc. Cat#08-1 15) in 30% ethanol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of the 96-well plate, and 
allowed to air dry for 2 hr. 

PC 12 cells are routinely grown in RPMI-1640 medium (Bio Whittaker) 
containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 
ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is done 
every three to four days. Cells are removed from the plates by scraping and 
resuspended with pipetting up and down for more than 15 times. 

Transfect the EGR/SEAP/Neo construct into PC 1 2 using the Lipofectamine 
protocol described in Example 1 1 . EGR-SEAP/PC12 stable cells are obtained by 
35 growing the ceUs in 300 ug/ml G418. The G418-free medium is used for routine 

growth but every one to two months, the cells should be re-grown in 300 ug/ml G418 
for couple of passages. 
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To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
(Phosphate buffered saline). Then starve the cells in low serum medium (RPMI-1640 
containing 1% horse serum and 0.5% FBS with antibiotics) overnight. 
5 The next morning, remove the medium and wash the cells with PBS. Scrape 

off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 
the cell number and add more low serum medium to reach final cell density as 5x10^ 
cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 
10 1x10^ cells/well). Add 50 ul supernatant produced by Example 11, 31^C for 48 to 72 
hr. As a positive control, a growth factor known to activate PC 12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold 
induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

15 

Example 16: High-Throughput Screening Assav for T -cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide variety 

' of agents including the inflammatory cytokines IL-1 and TNF, CD30 and CD40, 
lymphotoxin-alpha and lympholoxin-beta, by exposure to LPS or thrombin, and by 

20 expression of certain viral gene products. As a transcription factor, NF-kB regulates 
the expression of genes involved in immune cell activation, control of apoptosis (NF- 
kB appears to shield cells from apoptosis), B and T-cell development, anti-viral and 
antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with I-kB 

25 (Inhibitor kB). However, upon stimulation, I- kB is phosphorylated and degraded, 
causing NF- kB to shuttle to the nucleus, thereby activating transcription of target 
genes. Target genes activated by NF- kB include IL-2, IL-6, GM-CSF, ICAM-1 and 
class 1 MHC. 

Due to its central role and ability to respond to a range of stimuli, reporter 
30 constructs utilizing the NF-kB promoter element are used to screen the supematants 

produced in Example 1 1 . Activators or inhibitors of NF-kB would be useful in treating 
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diseases. For example, inhibitors of NF-kB could be used to treat those diseases 
related to the acute or chronic activation of NF-kB, such as rheumatoid arthritis. 

To constmct a vector containing the NF-kB promoter element, a PGR based 
strategy is employed. The upstream primer contains four tandem copies of the NF-kB 
5 binding site (GGGGACTTTCCC) (SEQ ID NO:8), 18 bp of sequence complementary 
to the 5' end of the SV40 early promoter sequence, and is flanked with an Xhol site: 

5':GCGGCCTCGAGGGGACTTTCCCGGGGACTTTCCGGGGACTTTCCGGGAC 
TTTCCATCCTGCCATCTCAATTAGrS' (SEQIDNO:9) 

The downstream primer is complementary to the 3' end of the SV40 promoter 
10 and is flanked with a Hind HI site: 

5':GCGGCAAGCi i rnGCAAAGCCTAGGC:3' (SEQ ID NO:4) 

PGR amplification is performed using the SV40 promoter template present in 
the pB-gal:promoter plasmid obtained firom Clontech. The resulting PGR fragment is 
digested with Xhol and Hind III and subcloned into BLSK2-. (Stratagene) 
Sequencing with the T7 and T3 primers confirms the insert contains the following 
sequence: 

5':GTGGAGGGGACnTCCGGGGGAGTTTGGGGGGACTTTGCGGGACTTTCC 

ATCTGGCATCTGAATTAGTCAGCAACCATAGTCGCGCGGCTAACTGCGGCCA 

TC CCGGGCCT AACTGGGCCCAGTTCGGGGGATTCTCCGGCCCATGGGTGAGT 

AAi 1 1 1 1 1 II ATTTATGGAGAGGGCGAGGCCGGCTCGGGCTCTGAGGTA1TC 

CAGAAGTAGTGAGGAGGGTTmTGGAGGGGTAGGCmTGCAAAAAGCTr- 
3' (SEQ ID NO: 10) 
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Next, replace the SV40 minimal promoter element present in the pSEAP2- 
promoter plasmid (Clontech) with this NF-kB/SV40 fragment using Xhol and HindlH. 
However, this vector does not contain a neomycin resistance gene, and therefore, is not 
preferred for mammalian expression systems. 

In order to generate stable mammalian cell lines, the NF-KB/SV40/SEAP 
cassette is removed from the above NF-kB/SEAP vector using restriction enzymes Sail 
and Noa, and inserted into a vector containing neomycin resistance. Particularly, the 
NF-KB/SV40/SEAP cassette was inserted into pGFP-1 (Glontech), replacing the GPP 
gene, after restricting pGFP-1 with Sail and Nod. 
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Once NF-KB/SV40/SEAP/Neo vector is created, stable Jurkat T-cells are 
created and maintained according to the protocol described in Example 13. Similarly, 
the method for assaying supernatants with these stable Jurkat T-cells is also described 
in Example 13. As a positive control, exogenous TNF alpha (0.1,1, 10 ng) is added to 
5 wells H9, HIO, and HI 1 , with a 5-10 fold activation typically observed. 

Example 17: Assav for SE AP Activity 

As a reporter molecule for the assays described in Examples 13-16, SEAP 
activity is assayed using the Tropix Phospho-light Kit (Cat. BP-4(X)) according to the 
10 following general procedure. The Tropix Phospho-light Kit supplies the Dilution, 
Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 |il of 2.5x 
dilution buffer into Optipiates containing 35 of a supernatant. Seal the plates with a 
plastic sealer and incubate at 65^0 for 30 min. Separate the Optipiates to avoid uneven 
15 heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and 
prime with the Assay Buffer. Add 50 |xl Assay Buffer and incubate at room 
temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 
table below). Add 50 \i\ Reaction Buffer and incubate at room temperature for 20 
20 minutes. Since the intensity of the chemiluminescent signal is time dependent, and it 

takes about 10 minutes to read 5 plates on luminometer, one should treat 5 plates at each 
time and start the second set 10 minutes later. 

Read the relative light unit in the luminometer. Set H 12 as blank, and print the 
results. An increase in chemiluminescence indicates reporter activity. 

25 



Reaction Buffer Formulation: 



#of plates 


Rxn buffer diluent (ml) 


CSPD (ml) 


10 


60 


3 


11 


65 


3.25 


12 


70 


3.5 


13 


75 


3.75 


14 


80 


4 


15 


85 


4.25 


16 


90 


4.5 


17 


95 


4.75 


18 


100 


5 


19 


105 


5.25 


20 


110 


5.5 


21 


115 


5.75 


22 


120 


6 
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23 125 

24 130 

25 135 

26 140 

27 145 

28 150 

29 155 

30 160 

31 165 

32 170 

33 175 

34 180 

35 185 

36 190 

37 195 

38 200 

39 205 

40 210 

41 215 

42 220 

43 225 

44 230 

45 235 

46 240 

47 245 

48 250 

49 , 255 

50 260 
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6.25 
6.5 
6.75 
7 

7.25 
7.5 
7.75 
8 

8.25 
8.5 
8.75 
9 

9.25 

9,5 

9.75 

10 

10.25 
10.5 
10,75 
11 

11.25 
11.5 
11.75 
12 

12.25 
12.5 
12.75 
13 



Example 18: Hiph-Throughpnt Screening A^..v TH.n tifvina rh.np.c 
gmall Molecule roncenfratlnn and Mpmhrane P^rmP^ihi pify 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
5 molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supematants which 
bmd to receptors of a particular cell. Although the following protocol describes an 
assay for calcium, this protocol can easily be modified to detect changes in potassium, 
sodium, pH, membrane potential, or any other small molecule which is detectable by a 
1 0 fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to 
measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-4 (Molecular Probes , inc . ; 
15 catalog no. F-14202) , used here. 

For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star black 
96.well plate with clear bottom. The plate is incubated in a CO^ incubator for 20 hours. 
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The adherent cells are washed two times in Biotek washer with 200 ul of HBSS 
(Hank's Balanced Sah Solution) leaving 100 ul of buffer after the final wash. 

A stock solution of 1 mg/ml fluo-4 is made in 10% pluronic acid DMSO. To 
load the cells with fluo-4 , 50 ul of 12 ug/ml fluo-4 is added to each well. The plate is 
5 incubated at 3TC in a CO^ incubator for 60 min. The plate is washed four times in the 
Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5x10^ cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 
fluo-4 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 
10 The tube is then placed in a 3TC water bath for 30-60 min. The cells are washed twice 
with HBSS, resuspended to 1x10^ cells/ml, and dispensed into a microplate, 100 
ul/well. The plate is centrifuged at 1000 rpm for 5 min. The plate is then washed once 
in Denley CellWaish with 200 ul, followed by an aspiration step to 100 ul final volume. 
For a non-cell based assay, each well contains a fluorescent molecule, such as 
15 fluo-4 . The supernatant is added to the well, and a change in fluorescence is detected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 
second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; and 
(6) Sample addition is 50 uL Increased emission at 530 nm indicates an extracellular 

20 signaling event which has resulted in an increase in the intracellular Ca"*^ 
concentration. 

Example 19: High-Throughput Screening Assav Identifying Tyrosine 
Kinase Activity 

25 The Protein Tyrosine Kinases (PTK) represent a diverse group of 

transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase 
RPTK) group are receptors for a range of mitogenic and metabolic growth factors 
including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor subfamilies. In 
addition there are a large family of RPTKs for which the corresponding Ugand is 

30 unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor dimerization, 
resulting in transphosphorylation of the receptor subunits and activation of the 
cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include receptor 

35 associated tyrosine kinases of the src-family (e.g., src, yes, Ick, lyn, fyn) and non- 
receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, members 
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of which mediate signal transduction triggered by the cytokine superfamily of receptors 
(e.g., the Interleukins, Interferons, GM-CSF, and Leptin). 

Because of the wide range of known factors capable of stimulating tyrosine 
kinase activity, the identification of novel human secreted proteins capable of activating 
5 tyrosine kinase signal transduction pathways are of interest. Therefore, the following 
protocol is designed to identify those novel human secreted proteins capable of 
activating the tyrosine kinase signal transduction pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 
25,000 cells per well in a 96 well Loprodyne SUent Screen Plates purchased from 
10 Nalge Nunc (NaperviUe, IL). The plates are sterilized with two 30 minute rinses with 
100% ethanol, rinsed with water and dried overnight. Some plates are coated for 2 hr 
with 100 ml of cell culture grade type I collagen (50 mg/ml), gelatin (2%) or polylysine 
(50 mg/ml), all of which can be purchased from Sigma Chemicals (St. Louis, MO) or 
10% Matrigel purchased from Becton Dickinson (Bedford,MA), or calf serum, rinsed 
15 with PBS and stored at 40C. Cell growth on these plates is assayed by seeding 5,000 
cells/well in growth medium and indirect quantitation of cell number through use of 
alamarBlue as described by the manufacturer Alamar Biosciences, Inc. (Sacramento, 
CA) after 48 hr. Falcon plate covers #3071 from Becton Dickinson (Bedford,MA) are 
used to cover the Loprodyne Silent Screen Plates. Falcon Microtest UI cell culture 
20 plates can also be used in some proliferation experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of 
Loprodyne plates (20,000/200ml/well) and cultured overnight in complete medium. 
Cells are quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 
minutes treatment with EGF (60ng/ml) or 50 ul of the supernatant produced in Example 
25 1 1 , the medium was removed and 1 00 ml of extraction buffer ((20 mM HEPES pH 
7.5. 0.15 M NaCl. 1% Triton X-100. 0.1% SDS. 2 mM Na3V04. 2 mM Na4P207 
and a cocktail of protease inhibitors (# 1836170) obtained from Boeheringer Mannheim 
(Indianapolis, IN) is added to each well and the plate is shaken on a rotating shaker for 
5 minutes at 40C. The plate is then placed in a vacuum transfer manifold and the extract 
30 filtered through the 0.45 mm membrane bottoms of each well using house vacuum. 
Extracts are collected in a 96-well catch/assay plate in the bottom of the vacuum 
manifold and immediately placed on ice. To obtain extracts clarified by centrifugation, 
the content of each well, after detergent solubiUzauon for 5 minutes, is removed and 
centrifuged for 15 minutes at 40C at 16,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described here. 



35 
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Generally, the tyrosine kinase activity of a supernatant is evaluated by 
determining its ability to phosphorylate a tyrosine residue on a specific substrate (a 
biotinylated peptide). Biotinylated peptides that can be used for this purpose include 
PSKl (corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and 
5 PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding the following components in 
order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2+ (5niM 

ATP/50mM MgCl2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, 

10 pH7.3, 40 mM beta-glycerophosphate, ImM EGTA, lOOniM MgCl2, 5 mM MnCl2, 

0.5 mg/ml BSA), then 5ul of Sodium Vanadate(lmM), and then 5ul of water. Mix the 

components gently and preincubate the reaction mix at 30^C for 2 min. Initial the 
reaction by adding lOul of the control enzyme or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 120mm 
15 EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 
mixture to a microtiter plate (MTP) module and incubating at 37^C for 20 min. This 
allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
Wash the MTP module with 300ul/well of PBS four times. Next add 75 ul of anti- 
20 phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr- 

POD(0.5u/ml)) to each well and incubate at 37^C for one hour. Wash the well as 
above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 mins (up to 30 min). Measure the 
25 absorbance of the sample at 405 nm by using ELIS A reader. The level of bound 
peroxidase activity is quantitated using an ELIS A reader and reflects the level of 
tyrosine kinase activity. 

Example 20: High-Throughput Screening Assay Identifying 
30 PhosDhor viatlon Activity 

As a potential alternative and/or compliment to the assay of protein tyrosine 
kinase activity described in Example 19, an assay which detects activation 
(phosphorylation) of major intracellular signal transduction intermediates can also be 
used. For example, as described below one particular assay can detect tyrosine 
35 phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf, JNK, p38 MAP, Map kinase kinase (MEK), MEK kinase. 
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Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be detected by 
substituting these molecules for Erk-1 or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELISA 
5 plate with 0.1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are then 
rinsed with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G plates 
are then treated with 2 commercial monoclonal antibodies (lOOng/well) against Erk-1 
and Erk-2 ( 1 hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a monoclonal antibody detecting any of the 

10 above described molecules.) After 3-5 rinses with PBS, the plates are stored at 4<^C 
until use. 

A431 cells are seeded at 20,000/well in a 96-well Loprodyne filterplate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6ng/well) or 50 ul of the supematants 
15 obtained in Example 1 1 for 5-20 minutes. The cells are then solubilized and extracts 
filtered directly into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As a 
positive control, a commercial preparation of MAP kinase (lOng/well) is used in place 
of A431 extract. Plates are then treated with a commercial polyclonal (rabbit) antibody 
20 (lug/ml) which specifically recognizes the phosphorylated epitope of the Erk-1 and 

Erk-2 kinases (1 hr at RT). This antibody is biotinylated by standard procedures. The 
bound polyclonal antibody is then quantitated by successive incubations with 
Europium-streptavidin and Europium fluorescence enhancing reagent in the Wallac 
DELFIA instrument (time-resolved fluorescence). An increased fluorescent signal over 
25 background indicates a phosphorylation. 

gyample 21: Method of Determininy A lterations in a Gene 
Correspond ing to a Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
30 phenotype of interest (such as a disease) is be isolated. cDN A is then generated from 
these RNA samples using protocols known in the art. (See, Sambrook.) The cDNA is 
then used as a template for PCR, employing primers surrounding regions of interest in 
SEQ ID NO:X. Suggested PCR conditions consist of 35 cycles at 95''C for 30 
seconds; 60-120 seconds at 52-58°C; and 60-120 seconds at 70^C, using buffer 
35 solutions described in Sidransky, D., et al.. Science 252:706 (1991). 
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PGR products are then sequenced using primers labeled at their 5' end with T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). 
The intron-exon borders of selected exons is also determined and genomic PGR 
products analyzed to confirm the results. PGR products harboring suspected mutations 

5 is then cloned and sequenced to validate the results of the direct sequencing. 

PGR products is cloned into T-tailed vectors as described in Holton, T.A. and 
Graham, M.W., Nucleic Acids Research, 19:1156 (1991) and sequenced with T7 
polymerase (United States Biochemical). Affected individuals are identified by 
mutations not present in unaffected individuals. 

10 Genomic rearrangements are also observed as a method of determining 

alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 
according to Example 2 are nick-translated with digoxigenindeoxy-uridine 5- 
triphosphate (Boehringer Manheim), and FISH performed as described in Johnson, 
Gg. et al.. Methods Cell Biol. 35:73-99 (1991), Hybridization with the labeled probe is 

15 carried out using a vast excess of human cot-1 DNA for specific hybridization to the 
corresponding genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenylidole and 
propidium iodide, producing a combination of G- and R-bands. Aligned images for 
precise mapping are obtained using a triple-band filter set (Chroma Technology, 

20 Brattleboro, VT) in combination with a cooled charge-coupled device camera 

(Photometries, Tucson, AZ) and variable excitation wavelength filters. (Johnson, Gv. 
et al.. Genet, Anal. Tech. Appl., 8:75 (1991).) Image collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of 

25 the genomic region hybridized by the probe are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated 
disease. 

Example 22; Method of Detecting Abnormal Levels of a Polypeptide in a 



A polypeptide of the present invention can be detected in a biological sample, 
and if an increased or decreased level of the polypeptide is detected, this polypeptide is 
a marker for a particular phenotype. Methods of detection are numerous, and thus, it is 
understood that one skilled in the art can modify the following assay to fit their 
35 particular needs. 

For example, antibody-sandwich ELIS As are used to detect polypeptides in a 
sample, preferably a biological sample. Wells of a microtiter plate are coated with 
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specific antibodies, at a final concentration of 0.2 to 10 ug/ml. The antibodies are either 
monoclonal or polyclonal and are produced by the method described in Example 10. 
The wells are blocked so that non-specific binding of the polypeptide to the well is 
reduced. 

5 The coated wells are then incubated for > 2 hours at RT with a sample 

containing the polypeptide. Preferably, serial dilutions of the sample should be used to 
validate results. The plates are then washed three times with deionized or distilled water 
to remove unbounded polypeptide. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 
1 0 concentration of 25-400 ng, is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
unbounded conjugate. 

Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-niu-ophenyl 
phosphate (NPP) substrate solution to each well and incubate 1 hour at room 
15 temperature. Measure the reaction by a microliter plate reader. Prepare a standard 

curve, using serial dilutions of a control sample, and plot polypeptide concentration on 
the X-axis (log scale) and fluorescence or absorbance of the Y-axis (linear scale). 
Interpolate the concentration of the polypeptide in the sample using the standard curve. 

20 Example 23: Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed in a fashion 
consistent with good medical practice, taking into account the clinical condition of the 
individual patient (especially the side effects of treatment with the secreted polypeptide 
alone), the site of delivery, the method of administration, the scheduling of 

25 administration, and other factors known to practitioners. The "effective amount" for 
purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutically effective amount of secreted 
polypeptide administered parenterally per dose will be in the range of about 1 ^ig/kg/day 
to 10 mg/kg/day of patient body weight, although, as noted above, this will be subject 

30 to therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and 
most preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If 
given continuously, the secreted polypeptide is typically administered at a dose rate of 
about 1 jtg/kg/hour to about 50 ng/kg/hour, either by 1-4 injections per day or by 
continuous subcutaneous infusions, for example, using a mini-pump. An intravenous 

35 bag solution may also be employed. The length of treatment needed to observe changes 
and the interval following treatment for responses to occur appears to vary depending 
on the desired effect. 
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Pharmaceutical compositions containing the secreted protein of the invention are 
administered orally, rectally, parenierally, intracistemally, intravaginally, 
intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 
patch), bucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" refers 
5 to a non-toxic sohd, semisolid or liquid filler, diluent, encapsulating material or 

formulation auxiliary of any type. The term "parenteral" as used herein refers to modes 
of administration which include intravenous, intramuscular, intraperitoneal, intrastemal, 
subcutaneous and intraarticular injection and infusion. 



10 systems. Suitable examples of sustained-release compositions include semi-permeable 
polymer matrices in the form of shaped articles, e.g., films, or mirocapsules. 
Sustained-release matrices include polylactides (U.S. Pat. No. 3J73,919, EP 58,481), 
copolymers of L-glutamic acid and gamma-ethyl-L-glutamate (Sidman, U. et al., 
Biopolymers 22:547-556 (1983)), poly (2- hydroxyethyl methacrylate) (R. Langer et 

15 al., J. Biomed. Mater. Res. 15:167-277 (1981). and R. Langer, Chem. Tech. 12:98- 
105 (1982)), ethylene vinyl acetate (R. Langer et al.) or poly-D- (-)-3-hydroxybutyric 
acid (EP 133,988). Sustained-release compositions also include liposomally entrapped 
polypeptides. Liposomes containing the secreted polypeptide are prepared by methods 
known per se: DE 3,218,121; Epstein et al., Proc. Natl. Acad. Sci. USA 82:3688-3692 

20 (1985); Hwang et al., Proc. Natl. Acad. Sci. USA 77:4030-4034 (1980); EP 52,322; 
EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese Pat. Appl. 83-1 18008; 
U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324. Ordinarily, the liposomes 
are of the small (about 200-800 Angstroms) unilamellar type in which the lipid content 
is greater than about 30 mol. percent cholesterol, the selected proportion being adjusted 

25 for the optimal secreted polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 
formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutically acceptable 
carrier, i.e., one that is non-toxic to recipients at the dosages and concentrations 

30 employed and is compatible with other ingredients of the formulation. For example, the 
formulation preferably does not include oxidizing agents and other compounds that are 
known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 

35 Then, if necessary, the product is shaped into the desired formulation. Preferably the 
carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood 
of the recipient. Examples of such carrier vehicles include water, saline. Ringer's 



The secreted polypeptide is also suitably administered by sustained-release 
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solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl 
oleate are also useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at 
5 the dosages and concentrations employed, and include buffers such as phosphate, 

citrate, succinate, acetic acid, and other organic acids or their salts; antioxidants such as 
ascorbic acid; low molecular weight (less than about ten residues) polypeptides, e.g., 
polyarginine or tripeptides; proteins, such as serum albumin, gelatin, or 
immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids, 
10 such as glycine, glutamic acid, aspartic acid, or arginine; monosaccharides, 

disaccharides, and other carbohydrates including cellulose or its derivatives, glucose, 
manose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannito! or 
sorbitol; counterions such as sodium; and/or nonionic surfactants such as polysorbates, 
poloxamers, or PEG. 

The secreted polypeptide is typically formulated in such vehicles at a 
concentration of about 0. 1 mg/ml to 1 00 mg/ml, preferably 1-10 mg/ml, at a pH of 
about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers will result in the formaUon of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterile. 
Sterility is readily accomplished by filtration through sterile filtration membranes (e.g., 
0.2 micron membranes). Therapeutic polypeptide compositions generally are placed 
into a container having a sterile access port, for example, an intravenous solution bag or 
vial having a stopper pierceable by a hypodermic injection needle. 

Polypeptides ordinarily will be stored in unit or multi-dose containers, for 
example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 
formulation for reconstitution. As an example of a lyophilized formulation, 10-ml vials 
are filled with 5 ml of sterile-filtered 1 % (w/v) aqueous polypeptide solution, and the 
resulting mixture is lyophilized. The infusion solution is prepared by reconstituting the 
lyophilized polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Associated with such container(s) can be a notice in the 
form prescribed by a governmental agency regulating tiie manufacture, use or sale of 
phannaceuticals or biological products, which notice reflects approval by the agency of 
manufactiire. use or sale for hiiman administi-ation. In addition, the polypeptides of the 
present invention may be employed in conjunction with otiier therapeutic compounds. 
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Example 24: Method of Treating Decreased Levels of the P olypeptide 

It will be appreciated that conditions caused by a decrease in the standard or 
normal expression level of a secreted protein in an individual can be treated by 
administering the polypeptide of the present invention, preferably in the secreted form. 
Thus, the invention also provides a method of treatment of an individual in need of an 
increased level of the polypeptide comprising administering to such an individual a 
pharmaceutical composition comprising an amount of the polypeptide to increase the 
activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily 
dose 0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 
polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
administration and formulation, are provided in Example 23. 

Example 25: Method of Treating Increased Levels of the Polypeptide 

Antisense technology is used to inhibit production of a polypeptide of the 
present invention. This technology is one example of a method of decreasing levels of 
a polypeptide, preferably a secreted form, due to a variety of etiologies, such as cancer. 

For example, a patient diagnosed with abnormally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 
2.0 and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest period 
if the treatment was well tolerated. The formulation of the antisense polynucleotide is 
provided in Example 23. 

Example 26: Method of Treatment Usin g Oene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
separated into small pieces. Small chunks of the tissue are placed on a wet surface of a 
tissue culture flask, approximately ten pieces are placed in each flask. The flask is 
turned upside down, closed tight and left at room temperature over night. After 24 
hours at room temperature, the flask is inverted and the chunks of tissue remain fixed to 
the bottom of the flask and fresh media (e.g.. Ham's F12 media, with 10% FBS, 

penicillin and streptomycin) is added. The flasks are then incubated at 37*^C for 

approximately one week. 
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At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et al., DNA, 7:219-25 (1988)), flanked by the long 
terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and 
Hindm and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified 
using PGR primers which correspond to the 5" and 3' end sequences respectively as set 
forth in Example 1 . Preferably, the 5" primer contains an EcoRI site and the 3" primer 
includes a HindlH site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and Hindm fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 
appropriate for ligation of the two fragments. The ligation mixture is then used to 
transform bacteria HB 101 , which are then plated onto agar containing kanamycin for 
the purpose of confirming that the vector has the gene of interest properiy inserted. 

The amphotropic pA317 or GP+aml2 packaging cells are grown in tissue 
culture to confluent density in Dulbecco's Modified Eagles Medium (DMEM) witii 10% 
calf serum (CS), penicillin and streptomycin. The MSV vector containing the gene is 
20 then added to the media and the packaging cells transduced with the vector. The 
packaging cells now produce infectious viral particles containing the gene (the 
packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequentiy, the 
media is harvested fi-om a 10 cm plate of confluent producer cells. The spent media, 
containing the infectious viral particles, is filtered through a millipore filter to remove 
detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the media 
from the producer cells. This media is removed and replaced witii fresh media. If the 
titer of virus is high, then virtually all fibroblasts will be infected and no selection is 
required. If the titer is very low, then it is necessary to use a retroviral vector that has a 
selectable marker, such as neo or his. Once the fibroblasts have been efficientiy 
infected, the fibroblasts are analyzed to determine whether protein is produced. 

The engineered fibroblasts are then transplanted onto Uie host, either alone or 
after having been grown to confluence on cytodex 3 microcarrier beads. 



35 



WO 9946289A 1 J_> 




wo 99/46289 PCT/US99/05721 

131 

Example 27: Method of Treatment Using Gene Therapy ■ In Vivo 

Another aspect of the present invention is using in vivo gene therapy methods 
to treat disorders, diseases and conditions. The gene therapy method relates to the 

5 introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) 
sequences into an animal to increase or decrease the expression of the polypeptide. 
The polynucleotide of the present invention may be operatively linked to a promoter 
or any other genetic elements necessary for the expression of the polypeptide by the 
target tissue. Such gene therapy and delivery techniques and methods are known in the 

10 art, see, for example, WO90/11092, W098/11779; U.S. Patent NO. 5693622, 
5705151, 5580859; Tabata H. et al. (1997) Cardiovasc. Res. 35(3):470-479, Chao J et 
al. (1997) Pharmacol. Res. 35(6):5 17-522, Wolff J. A. (1997) Neuromuscul. Disord. 
7(5):314-318, Schwartz B. et al. (1996) Gene Ther. 3(5):405-41 1, Tsurumi Y. et al. 
(1996) Circulation 94(1 2):328 1-3290 (incorporated herein by reference). 

15 The polynucleotide constructs may be delivered by any method that delivers 

injectable materials to the cells of an animal, such as, injection into the interstitial 
space of tissues (heart, muscle, skin, lung, liver, intestine and the like). The 
polynucleotide constructs can be delivered in a pharmaceutically acceptable liquid or 
aqueous carrier. 

20 The term "naked" polynucleotide, DNA or RNA, refers to sequences that are 

free from any delivery vehicle that acts to assist, promote, or facilitate entry into the 
cell, including viral sequences, viral particles, liposome formulations, lipofectin or 
precipitating agents and the like. However, the polynucleotides of the present 
invention may also be delivered in liposome formulations (such as those taught in 

25 Feigner P.L. et al. (1995) Ann. NY Acad. Sci. 772:126-139 and Abdallah B. et al. 
(1995) Biol. Cell 85(1): 1-7) which can be prepared by methods well known to those 
skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are 
preferably constmcts that will not integrate into the host genome nor will they contain 
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sequences that allow for replication. Any strong promoter known to those skilled in 
the art can be used for driving the expression of DNA. Unlike other gene therapies 
techniques, one major advantage of introducing naked nucleic acid sequences into 
target cells is the transitory nature of the polynucleotide synthesis in the cells. Studies 
have shown that non-replicating DNA sequences can be introduced into cells to 
provide production of the desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of 
tissues within the an animal, including of muscle, skin, brain, lung, liver, spleen, bone 
marrow, thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, 
stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and 
connective tissue. Interstitial space of the tissues comprises the intercellular fluid, 
mucopolysaccharide matrix among the reticular fibers of organ tissues, elastic fibers in 
the walls of vessels or chambers, collagen fibers of fibrous tissues, or that same matrix 
within connective tissue ensheathing muscle cells or in the lacunae of bone. It is 
similarly the space occupied by the plasma of the circulation and the lymph fluid of 
the lymphatic channels. Delivery to the interstitial space of muscle tissue is preferred 
for the reasons discussed below. They may be conveniently deUvered by injection 
into the tissues comprising these cells. They are preferably delivered to and expressed 
in persistent, non-dividing cells which are differentiated, although delivery and 
expression may be achieved in non-differentiated or less completely differentiated 
cells, such as, for example, stem cells of blood or skin fibroblasts. In vivo muscle cells 
are particularly competent in their ability to take up and express polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of DNA or 
RNA will be in the range of fix)m about 0.05 g/kg body weight to about 50 mg/kg body 
weight. Preferably the dosage will be fi-om about 0.005 mg/kg to about 20 mg/kg and 
more preferably fi-om about 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of 
ordinary skill will appreciate, this dosage will vary according to the tissue site of 
injection. The appropriate and effective dosage of nucleic acid sequence can readily be 
determined by those of ordinary skill in the art and may depend on the condition being 
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treated and the route of administration. The preferred route of administration is by 
the parenteral route of injection into the interstitial space of tissues. However, other 
parenteral routes may also be used, such as, inhalation of an aerosol formulation 
particularly for delivery to lungs or bronchial tissues, throat or mucous membranes of 
5 the nose. In addition, naked polynucleotide constructs can be delivered to arteries 
during angioplasty by the catheter used in the procedure. 

The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard 
10 recombinant DNA methodology. The template DNA, which may be either circular or 
linear, is either used as naked DNA or complexed with liposomes. The quadriceps 
muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made on 
15 the anterior thigh, and the quadriceps muscle is directly visualized. The template 
DNA is injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge needle over 
one minute, approximately 0.5 cm from the distal insertion site of the muscle into the 
knee and about 0.2 cm deep, A suture is placed over the injection site for future 
localization, and the skin is closed with stainless steel clips. 
20 After an appropriate incubation time (e.g., 7 days) muscle extracts are 

prepared by excising the entire quadriceps. Every fifth 15 um cross-section of the 
individual quadriceps muscles is histochemically stained for protein expression. A 
time course for protein expression may be done in a similar fashion except that 
quadriceps from different mice are harvested at different times. Persistence of DNA 
25 in muscle following injection may be determined by Southern blot analysis after 
preparing total cellular DNA and HIRT supematants from injected and control mice. 
The results of the above experimentation in mice can be use to extrapolate proper 
dosages and other treatment parameters in humans and other animals using naked 
DNA. 
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Example 2«: Transfff nic Animals. 

The polypeptides of the invention can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, 
guinea pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e.g., 
baboons, monkeys, and chimpanzees may be used to generate transgenic animals. In a 
specific embodiment, techniques described herein or otherwise known in the art, are 
used to express polypeptides of the mvemion in humans, as part of a gene therapy 
protocol. 

Any technique known in the art may be used to introduce the transgene (i.e., 
polynucleotides of the invention) into animals to produce the founder lines of 
transgenic animals. Such techniques include, but are not limited to, pronuclear 
microinjection (Paterson et al., Appl. Microbiol. Biotechnol. 40:691-698 (1994); 
Carver etal., Biotechnology (NY) 11:1263-1270 (1993); Wright et al.. Biotechnology 
(NY) 9:830-834 (1991); and Hoppe et al., U.S. Pat. No. 4.873,191 (1989)); retrovirus 
mediated gene transfer into gemi lines (Van der Putten et al., Proc. Natl. Acad. Sci., 
USA 82:6148-6152 (1985)), blastocysts or embryos; gene targeting in embryonic stem 
cells (Thompson et al., Cell 56:313-321 (1989)); electroporation of cells or embryos 
(Lo, 1983, Mol Cell. Biol. 3:1803-1814 (1983)); introduction of the polynucleotides 
of the invention using a gene gun (see, e.g., Uhner et al.. Science 259:1745 (1993); 
introducing nucleic acid constructs into embryonic pleuripotent stem cells and 
transfeiring the stem cells back into the blastocyst; and sperm-mediated gene transfer 
(Lavitrano et al.. Cell 57:717-723 (1989); etc. For a review of such techniques, see 
Gordon, "Transgenic Animals," Intl. Rev. Cytol. 115:171-229 (1989), which is 
25 incorporated by reference herein in its entirety. 

Any technique known in the art may be used to produce transgenic clones 
containing polynucleotides of the invention, for example, nuclear transfer into 
enucleated oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to 
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quiescence (Campell et al. Nature 380:64-66 (1996); Wilmut et al.. Nature 385:810- 
813 (1997)). 



The present invention provides for transgenic animals that carry the transgene 
in all their cells, as well as animals which carry the transgene in some, but not all their 

5 cells, I.e., mosaic animals or chimeric. The transgene may be integrated as a single 
transgene or as multiple copies such as in concatamers, e.g., head-to-head tandems or 
head-to-tail tandems. The transgene may also be selectively introduced into and 
activated in a particular cell type by following, for example, the teaching of Lasko et 
al. (Lasko et al., Proc. Natl. Acad. Sci. USA 89:6232-6236 (1992)). The regulatory 

10 sequences required for such a cell-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. When 
it is desired that the polynucleotide transgene be integrated into the chromosomal site 
of the endogenous gene, gene targeting is preferred. Briefly, when such a technique is 
to be utilized, vectors containing some nucleotide sequences homologous to the 

15 endogenous gene are designed for the purpose of integrating, via homologous 
recombination with chromosomal sequences, into and disrupting the function of the 
nucleotide sequence of the endogenous gene. The transgene may also be selectively 
introduced into a particular cell type, thus inactivating the endogenous gene in only 
that cell type, by following, for example, the teaching of Gu et al. (Gu et al.. Science 

20 265:103-106 (1994)). The regulatory sequences required for such a cell-type specific 
inactivation will depend upon the particular cell type of interest, and will be apparent 
to those of skill in the art. 

Once transgenic animals have been generated, the expression of the 
recombinant gene may be assayed utilizing standard techniques. Initial screening may 

25 be accomplished by Southern blot analysis or PCR techniques to analyze animal 
tissues to verily that integration of the transgene has taken place. The level of mRNA 
expression of the transgene in the tissues of the transgenic animals may also be 
assessed using techniques which include, but are not hmited to, Northern blot analysis 
of tissue samples obtained from the animal, in situ hybridization analysis, and reverse 
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transcriptase-PCR (rt-PCR). Samples of transgenic gene-expressing tissue may also 
be evaluated immunocytochemically or immunohistochemically using antibodies 
specific for the transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more 
than one integration site in order to establish separate lines; inbreeding of separate 
Imes in order to produce compound transgenics that express the transgene at higher 
levels because of the effects of additive expression of each transgene; crossing of 
heterozygous transgenic animals to produce animals homozygous for a given 
integration site in order to both augment expression and eliminate the need for 
screening of animals by DNA analysis; crossing of separate homozygous lines to 
produce compound heterozygous or homozygous lines; and breeding to place the 
transgene on a distinct background that is appropriate for an experimental model of 
15 interest. 

Transgenic animals of the invention have uses which include, but are not 
limited to, animal model systems useful in elaborating the biological function of 
polypeptides of the present invention, studying conditions and/or disorders 
associated with aberrant expression, and in screening for compounds effective in 
20 ameliorating such conditions and/or disorders. 

Example 2Q; Knock-Out Animals. 

Endogenous gene expression can also be reduced by inactivating or "knocking 
out" the gene and/or its promoter using targeted homologous recombination. {E.g., see 
25 Smithies et al.. Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51:503-512 
(1987); Thompson et al.. Cell 5:313-321 (1989); each of which is incorporated by 
reference herein in its entirety). For example, a mutant, non-functional polynucleotide 
of the invention (or a completely unrelated DNA sequence) flanked by DNA 
homologous to the endogenous polynucleotide sequence (either the coding regions or 
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regulatory regions of the gene) can be used, with or without a selectable marker and/or 
a negative selectable marker, to transfect cells that express polypeptides of the 
invention in vivo. In another embodiment, techniques known in the art are used to 
generate knockouts in cells that contain, but do not express the gene of interest. 
5 Insertion of the DNA construct, via targeted homologous recombination, results in 
inactivation of the targeted gene. Such approaches are particularly suited in research 
and agricultural fields where modifications to embryonic stem cells can be used to 
generate animal offspring with an inactive targeted gene (e.g., see Thomas & Capecchi 
1987 and Thompson 1989, supra). However this approach can be routinely adapted 

10 for use in humans provided the recombinant DNA constructs are directly administered 
or targeted to the required site in vivo using appropriate viral vectors that will be 
apparent to those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered to 
express the polypeptides of the invention, or alternatively, that are genetically 

15 engineered not to express the polypeptides of the invention (e.g., knockouts) are 
administered to a patient in vivo. Such cells may be obtained from the patient (i.e., 
animal, including human) or an MHC compatible donor and can include, but are not 
limited to fibroblasts, bone marrow cells, blood cells (e.g. , lymphocytes), adipocytes, 
muscle cells, endothelial cells etc. The cells are genetically engineered in vitro using 

20 recombinant DNA techniques to introduce the coding sequence of polypeptides of the 
invention into the cells, or alternatively, to disrupt the coding sequence and/or 
endogenous regulatory sequence associated with the polypeptides of the invention, 
by transduction (using viral vectors, and preferably vectors that integrate the 
transgene into the cell genome) or transfection procedures, including, but not limited 

25 to, the use of plasmids, cosmids, YACs, naked DNA, electroporation, liposomes, etc. 
The coding sequence of the polypeptides of the invention can be placed under the 
control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 
expression, and preferably secretion, of the polypeptides of the invention. The 
engineered cells which express and preferably secrete the polypeptides of the 
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invention can be introduced into the patient systemically, e.g., in the circulation, or 
intraperitoneally. 

Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, genetically engineered fibroblasts can be implanted as part of a skin graft; 
5 genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. (See, for example, Anderson et al. U.S. Patent No. 5,399,349; and 
Mulligan & Wilson, U.S. Patent No. 5,460,959 each of which is incorporated by 
reference herein in its entirety). 

When the cells to be administered are non-autologous or non-MHC compatible 

10 cells, they can be administered using well known techniques which prevent the 
development of a host immune response against the introduced cells. For example, the 
cells may be introduced in an encapsulated form which, while allowing for an exchange 
of components with the immediate extracellular environment, does not allow the 
introduced cells to be recognized by the host immune system. 

15 Transgenic and "knock-out" animals of the invention have uses which include, 

but are not limited to, animal model systems usefiil in elaborating the biological 
function of polypeptides of the present invention, studying conditions and/dr 
disorders associated with aberrant expression, and in screening for compounds 
effective in ameliorating such conditions and/or disorders. 

20 

It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. Numerous modifications and 
variations of the present invention are possible in light of the above teachings and, 
therefore, are within the scope of the appended claims. 

25 The entire disclosure of each document cited (including patents, patent 

applications, journal articles, abstracts, laboratory manuals, books, or other 
disclosures) in the Background of the Invention, Detailed Description, and Examples is 
hereby incorporated herein by reference. Further, the hard copy of the sequence listing 
submitted herewith and the corresponding computer readable form are both 

30 incorporated herein by reference in their entireties. 
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EWICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
onpage §4 .line ^/A 



B. IDENTIFlCATIONOFDEPOSrr Furtherdeposits are identified on an additional sheet | | 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0-2209 
United States of America 



Date of deposit 



February 12, 1998 



Accession Number 



209627 



a ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E SEPARATE FURNISHING 0¥lSDlCATlO^S(leaveblankifnotappUcabU) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.s^. ''Accession 
Niunber of Deposit") 



For receiving Office use only 



15^ Thissheet wasreceived with the intemauonai application 



Auihorizedofficer ^y^^,, ^eadOWS 

Paralegal Speciallsl 
, APD-PCT Operations 

FonnPCT/RO/134 oUm^^''''*^ 



For International Bureau use only 



I \ This sheet was received by the International Bureau on: 



Authorized officer 
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What Is Claimed Is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group 
consisting of: 

(a) a polynucleotide fragment of SEQ ID NO.X or a polynucleotide fragment of 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NO: Y or a 
polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit No:Z. 
which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit No:Z. 
which is hybridizable to SEQ ID NO:X; 

^ ^ ^ polynucleotide encoding a polypeptide epitope of SEQ ID NO: Y or a 

polypeptide epitope encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO: Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 

20 having biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID NO: Y; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any 
one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 
hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 



25 



30 



2 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a secreted protein. 



3 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO:Y or the polypeptide encoded by the cDNA sequence included 
35 in ATCC Deposit No:Z. which is hybridizable to SEQ ID NO:X. 
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4 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:X or 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X. 

5 

5 . The isolated nucleic acid molecule of claim 2, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

10 6 . The isolated nucleic acid molecule of claim 3, wherein the nucleotide 

sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
15 claim 1. 

8. A method of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1. 

20 9. A recombinant host cell produced by the method of claim 8. 

1 0. The recombinant host cell of claim 9 comprising vector sequences. 

11. An isolated polypeptide comprising an amino acid sequence at least 95% 
25 identical to a sequence selected from the group consisting of: 

(a) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence included 
in ATCC Deposit No:Z, having biological activity; 

30 (c) a polypeptide domain of SEQ ID NO: Y or the encoded sequence included in 

ATCC Deposit No:Z; 

(d) a polypeptide epitope of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO: Y or the encoded sequence included in 
35 ATCC Deposit No:Z; 

(f) a full length protein of SEQ ID NO: Y or the encoded sequence included in 
ATCC Deposit No:Z; 
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(g) a variant of SEQ ID NO: Y; 

(h) an allelic variant of SEQ ID NO:Y; or 

(i) a species homologue of the SEQ ID NO: Y. 

1 2 . The isolated polypeptide of claim 1 1 , wherein the secreted form or the 
fiill length protein comprises sequential amino acid deletions from either the C-terminus 
or the N-terminus. 



13. An isolated antibody that binds specifically to the isolated polypeptide of 
claim 1 1 . 



14, A recombinant host cell that expresses the isolated polypeptide of claim 

11. 



15. A method of making an isolated polypeptide comprising: 

(a) culturing the recombinant host cell of claim 14 under conditions such that 
said polypeptide is expressed; and 

(b) recovering said polypeptide. 

1 6. The polypeptide produced by claim 15. 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a mammalian subject a therapeutically effective amount of 
the polypeptide of claim 1 1 or the polynucleotide of claim 1. 

18. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 



19. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of 
claim 1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 
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20. A method for identifying a binding partner to the polypeptide of claim 1 1 
comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 
5 (b) determining whether the binding partner effects an activity of the 

polypeptide. 

21. The gene corresponding to the cDNA sequence of SEQ ID NO:Y. 

10 22. A method of identifying an activity in a biological assay, wherein the 

method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

15 (d) identifying the protein in the supernatant having the activity. 

23. The product produced by the method of claim 20. 
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<110> Human Genome Sciences, Inc. et al . 

<120> 31 Human secreted proteins 

<130> PZ026PCT 

<140> Unassigned 
<141> 1999-03-11 

<150> 60/077,714 
<151> 1998-03-12 

<150> 60/077,687 
<151> 1998-03-12 

<150> 60/077,686 
<151> 1998-03-12 

<150> 60/077,696 
<151> 1998-03-12 

<160> 176 

<170> Patent In Ver. 2.0 



<210> 1 
<211> 733 
<212> DNA 

<213> Homo sapiens 



<400> 1 

gggatccgga gcccaaatct tctgacaaaa ctcacacatg cccaccgtgc ccagcacctg 60 

aattcgaggg tgcaccgtca gtcttcctct tccccccaaa acccaaggac accctcatga 12 0 

tctcccggac tcctgaggtc acatgcgtgg tggtggacgt aagccacgaa gaccctgagg 180 

tcaagttcaa ctggtacgtg gacggcgtgg aggtgcataa tgccaagaca aagccgcggg 240 

aggagcagta caacagcacg taccgtgtgg tcagcgtcct caccgtcctg caccaggact 300 

ggctgaatgg caaggagtac aagtgcaagg tctccaacaa agccctccca acccccatcg 3 60 

agaaaaccat ctccaaagcc aaagggcagc cccgagaacc acaggtgtac accctgcccc 420 
catcccggga tgagctgacc aagaaccagg tcagcctgac ctgcctggtc aaaggcttct 
atccaagcga catcgccgtg gagtgggaga gcaatgggca gccggagaac aactacaaga 
ccacgcctcc cgtgctggac tccgacggct ccttcttcct ctacagcaag ctcaccgtgg 

acaagagcag gtggcagcag gggaacgtct tctcatgctc cgtgatgcat gaggctctgc 660 

acaaccacta cacgcagaag agcctctccc tgtctccggg taaatgagtg cgacggccgc 72 0 



480 
540 
600 
660 



<210> 2 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> Site 
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<222> (3) 

<223> Xaa equals any of the twenty naturally ocurring L-ammo acids 
<400> 2 

Trp Ser Xaa Trp Ser 
1 5 

<210> 3 
<211> 86 
<212> DNA 

<213> Homo sapiens 
<400> 3 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 60 
cccgaaatat ctgccatctc aattag 



<210> 4 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 4 

gcggcaagct ttttgcaaag cctaggc 



<210> 5 

<211> 271 

<212> DNA 

<213> Homo sapiens 

<400> 5 

ctcgagattt ccccgaaatc tagatttccc cgaaatgatt tccccgaaat gatttccccg 

aaatatctgc catctcaatt agtcagcaac catagtcccg cccctaactc cgcccatccc 

gcccctaact ccgcccagtt ccgcccattc tccgccccat ggctgactaa ttttttttat 

ttatgcagag gccgaggccg cctcggcctc tgagctattc cagaagtagt gaggaggctt 

ttttggaggc ctaggctttt gcaaaaagct t 



60 
120 
180 
240 
271 



<210> 6 

<211> 32 

<212> DNA 

<213> Homo sapiens 

<400> 6 

gcgctcgagg gatgacagcg atagaacccc gg 



<210> 7 

<211> 31 

<212> DNA 

<213> Homo sapiens 



<400> 7 

gcgaagcttc gcgactcccc ggatccgcct c 



31 
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<210> 8 

<211> 12 

<212> DNA 

<213> Homo sapiens 

<400> 8 
ergggactttc cc 



<210> 9 

<211> 73 

<212> DNA 

<213> Homo sapiens 



<400> 9 

gcggcctcga ggggactttc ccggggactt tccggggact ttccgggact ttccatcctg 
ccatctcaat tag ^ 



60 
73 



<210> 10 

<211> 256 

<212> DNA 

<213> Homo sapiens 



<400> 10 

ctcg&gggga ctttcccggg gactttccgg ggactttccg ggactttcca tctgccatct 
caattagtca gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc 
cagttccgcc cattctccgc cccatggctg actaattttt tttatttatg cagaggccga 
ggccgcctcg gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg 
cttttgcaaa aagctt 



<210> 11 

<211> 790 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (37) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (55) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (76) 

<223> n equals a,t,g, or c 
<220> 
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<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (120) 

<223> n equals a,t:,g, or c 
<220> 

<221> SITE 
<222> (137) 

<223> n equals a,t,g, or c 
<400> 11 

tcaactgggt gaaaaggaaa acccaccctt ggcgccnaac acgcaaaccg ccttntcccc 60 

ggcgcgttgg ccgatncatt aatgcagctg gcacgacagt tttcccgact gnaaagcggn 120 

cagtgagcgc aacgcantta aatgtgagtt agctcactca ttagcacccc aggctttaca 180 

ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 240 

aaacagctat gaccatgatt acgccaagct ctaatacgac tcactatagg gaaagctggt 3 00 

acgcctgcag gtaccggtcc ggaattcccg ggtcgaccca cgcgtccggt tgaatgcact 3 60 

gagtcccttg gtgtagtagc aataaggaaa aatgaaatta ctttcctgtg cacacagtcc 42 0 

agcctaattg gtatgtgatg ttgcacttag cagccatgtg gtgggcatgt gtgactactc 480 

tggttttcac tttagtttct aaacttttta tccctctcaa gtccagcatg gatggggaaa 540 
tgtctctgga tccccacagc tgtgtacttg tttgcatttg tttccctttg agatttgtgt 600 
ttgtgtcctg ctttgagctg taccttgtcc agtccattgt gaaattatcc cagcagctgt 



tgtgttttat aaagacaaca gtggcttcta wwaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa 



660 



aatgtacagt tccttctgaa gcaagcaaca tcagcagcag cagcagcagc agcacaattc 720 



780 
790 



<210> 12 
<211> 554 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (552) 

<223> n equals a,t,g, or c 
<400> 12 

ttcggcacga ggtctttacc tccaaactaa cttctttcct gaacagtaga atagtttttc 60 
atactatcat catttggatg gagctcttta aactgacctc agagatcaga ttcataacct 120 
tttgtccaga gcaatggatg cctttgctgg ttccccgttc tcattgatgg tccctaaatg 180 
tgtacttata ctgttctgtc tagtctacag cttacagtgc attcagcctt attcaagctt 240 

300 
360 
420 



attgaattca gcctcgttgc cttatcacca cgggcttaaa ctagctaatc ttttattaat 

tgtattctat cctcacatac attctatccc tttttcctca agtcatcctt ctaaactgca 

catctgatca catttgaatc ttagctcctt tacttgcttt ctggccttgg gcagttgttt 

ataatgctct gtgtcctcca ttcctcctgc ctcctactgt ggttcatggc ttaatatatg 480 

taaactatgg cattacctta ctgcttaaaa ctctuaaatt taaaaaaaaa aaaaaaaaac 540 
tcgagggggg gncc ^54 



<210> 13 
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<211> 1106 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (1017) 

<223> n equals a,t,g, or c 



<400> 13 

gagcaagctc 

aacttcttag 

ttggcacagg 

ttatggtcct 

ttcttgtccc 

gaccaaaggc 

tccagtgccc 

ctcttcccaa 

gggagagggg 

ctgtgtatag 

gtcatgaaca 

ccgacttgaa 

ttgtcagaag 

gaccggagcc 

tgcaaggagc 

cacaataagg 

aaagggcttt 

tgtaagccag 

ttaaaaaaaa 



attttttttt 

cttgggttca 

cttggagtat 

ggacagaagg 

ctttccagat 

caagttctgt 

actgatttct 

gttttaaaaa 

accctttctg 

cagatatcac 

ctatctcatg 

actcacagtc 

ctcagaaagc 

ccagatgcgc 

tccagtgtct 

cagattttca 

aatcaacaga 

aaaatgacat 

aaaaaaaaaa 



cctatgaggc 
tgcaccccca 
ggatttatta 
taagcttcct 
gttaggtttt 
gtccccatgg 
ctaattgtga 
tatcagttct 
tggcagctca 
tagtaatagc 
ttgtgtaaca 
gtcccctcag 
ttattctgtt 
tggtgtcact 
cccggacagt 
ttctgtaccc 
tatgttcatt 
tcgagatttt 
ctcgag 



ttttgtaagt 
gtcagtataa 
caggtctgtt 
tgcaacttcc 
aaacaatgac 
gagattaaaa 
gtctttctgc 
aagtaggcct 
gtgttcagca 
atttrgtaag 
ctttcatttt 
aaaggcaggg 
gctgacagtc 
gatgtcccgt 
gatgatgtga 
aaaacaggaa 
tttcacgtag 
caagtgagaa 



cctgacctgt 
ctgtggacct 
tctttttgtt 
ctggtccggt 
tgttctttct 
cccaagcccc 
ttacctagta 
agcgtttcta 
ttcctgtaag 
tgatgttcac 
tccaagaact 
caaatgttgt 
cttgcaaggg 
gccgggcatg 
aaacatttag 
cacagattta 
acctatttta 
caaatgattt 



atttactgtt 
catacccact 
tttctcccat 
gggtagagtt 
ccatcatgta 
tatgtctagg 
cctagagttt 
catattttta 
ttagcatgct 
acatgctgct 
ttataatcag 
tatttccaat 
tcagaatcag 
agcccttctg 
aaccgaccta 
atgcagagca 
caagctnact 
ggtccaataa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1106 



<210> 14 
<211> 568 
<212> DNA 

<213> Homo sapiens 



<400> 14 

gtggatccaa agaattcgca cgagtgccga tcagctcgga ccgaaaaaag tggtttwatt 60 

cgggctggct tgttgcggtg tgagcggtct gttttatgcc atggcttttt ggttcactgg 120 

tctgccgttg ctgagtttaa ttctgctgtg cattggcagg gtgtttctcg gcgtcggcga 180 

aagctttgcc agtacggggt ctaccctatg ggggattggc ctggtggggc cgttgcatac 240 

cgcccgggtt atctcatgga atggggtggc gacttacggt gcgatggctg ccggggcacc 300 

gctcggtgtt tacctcaatc agcactgggg gttggctggg gtggcggcgt tgatcgtgtt 360 

ggcggtggcg gtttcgctgt ggctggcgag tgcgaaccca acgtgacgat cgccgccggt 420 

aagcgtattg cctttagcgc atgttggggc gtatttggac ttacggtctg ggacttgcaa 480 

tgggtaccgt gggttttggc ggcacgagag tacttctaga gcggccgcgg gcccatcgat 540 

tttccacccg ggtggggtac caggtaat 5gg 



<210> 15 
<211> 3692 
<212> DNA 

<213> Homo sapiens 

<220> 
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<221> SITE 
<222> (518) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (606) 

<223> n equals a,t,g, or c 

<220> 

<221> SITE 
<222> (639) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2303) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (2441) 

<223> n equals a,t,g, or c 
<400> 15 

aattcggcac aggttgtgtt tctmatgttc caggtccggc caggctggca gctcctgctg 



60 



gtcatgtttt cctcatgtgc tgtttccaac cagctcttgg tctggtaccc agcaactgcc 120 
ttagcagaca acaaacctgt agcacctgac cgacgaatca gtgggcatgt gggcatcatc 
ttcagcatgt catacctgga aagcaaggga ttgctggcta cagyttcaga agaccgaagc 
gttcgtatct ggaaggtggg cgacctgcga gtgcctgggg gtcgggtgca gaatattggg 
cactgctttg ggcacagcgc ccgtgtgtgg caggtcaagc ttctagagaa ttaccttatc 
agtgcaggag aggattgtgt ctgcttggtg tggagccatg aaggtgagat tctccaggcc 420 
tttcggggac accagggayg tggkayccgg gccatagctg cccatgagag gcaggcctgg 
gtgatcactg ggggtgatga ctccaggcat cggctgtngc acttggtagg gcgtgggtac 
cggggattgg gggtctcggc tctctgcttc aagtcccgta gtaggccagg tacactcaag 
gctgknactc tggctggctc ttggcgactg ctggcagtna ctgatacagg ggccctgtat 
ctctatgacg tcgaggtcaa gtgctgggag cagctgctag aggataaaca tttccagtcc 
tactgcctgc tggaggcagc tcctggtccc gagggcttcg gattgtgtgc tatggccaat 
ggggaaggtc gtgtcaaggt tgtccccatc aacactccaa ctgctgctgt ggaccagacc 
ctgtttcctg ggaaggtgca cagcttgagc tgggccctgc gtggttatga ggagctcctg 900 
ttgctggcat cgggccctgg cggggtagta gcttgcctag agatctcagc cgcaccctct 960 
ggcaaggcca tctttgtcaa ggaacgttgt cggtacctgc tgcccccaag caagcagaga 102 0 

tggcacacat gcagtgcctt cctaccccca ggtracttcc tggtgtgtgg tgaccgccgg 1080 
ggctctgtgc tgctattccc ctccaracca ggtctgctca aggaccctgg ggtgggaggc 1140 
aaggctcggg ctggtgctgg ggcactgtag tgggtagtgg tagtagtggg ggtgggaatg 1200 
ctttcactgg gttgggccca gtgtctaccc tgccctctct gcacgggaag cagggtgtga 1260 
cctcagtcac atgccatggt ggctatgtgt ataccacagg gcgtratgga gcctactacc 1320 
agctgtttgt acgagacggc cagctccagc cagtcctaag gcagaagtcc tgtcgaggca 1380 
tgaactggct agctgggctc cgtatagtgc ccgatgggag catggttatc ctgggtttcc 1440 
atgccaatga gtttgtggtg tggaaccctc ggtcacacga gaagctgcac atcgtcaact 1500 
gtggtggagg gcaccgttcg tgggcattct ctgatactga ggcggccatg gcctttgctt 1560 
acctcaagga tggggatgtc atgctgtaca gggctctggg tggctgcacc cggccacacg 1620 
tgattctccg ggagggtctg catggccgtg agatcacttg tgtaaagcgt gtgggcacca 1680 
ttaccctggg gcctgaatat ggagtgccca gcttcatgca gcctgatgac ctggagcctg 1740 
gcagtgaggg gcccgacttg actgacattg tgatcacatg tagtgaggac actactgtct 



180 
240 
300 
360 



480 
540 
600 
660 
720 
780 
840 



1800 
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gtgtcctagc actccctaca accacaggct cagcccacgc actcacagct gtttgtaacc 

atatctcctc ggtacgtgct gtggctgtgt ggggcattgg caccccaggt ggccctcagg 1920 

atcctcagcc aggcctgact gcccatgtgg tgtctgcggg ggggcgggct gagatgcact 1980 

gcttcagcat catggttact ccggacccca gcaccccaag ccgcctcgcc tgccatgtca 2040 

tgcaccttts gtcccaccgg ctagatgagt attgggaccg gcaacgcaat cggcatcgga 2100 

tggttaaggt agacccagag accaggtaat atatgctcct gggcagggtg tggtatgggt 2160 

catgcagatg ctcccaggct tgcaggctcc acctgacagc tgcatgttgt ctctgcaggt 222 0 
acatgtccct tgctgtgtgt gaacttgacc agcccggcct tggccccctt gtggctgcag 



1860 



■ — — — — — — — — ^■>— ^wv-*^ 2280 

cctgtagtga tggggccgta agntctttct tttgcaggat tctgggcgga ttctgcagct 2340 

ccttgctgaa accttccacc ataagcratg tgtcctcaag gtccactcct ttacacacga 2400 

ggcacccaac cagaggcgga ggctcctcct gtgcagcgca ntactgatgg cagcctggct 2460 

ttctgggatc tcaccaccat gctagaccat gactccactg tcctggagcc tccagtggat 2520 

cctgggcttc cctaccggct tggcaccccc tccctgactc tccaggccca cagctgtggt 2580 

atcaacagcc tgcacacctt gcccacccgt gagggccacc atctcgtggc cagtggcagt 2640 

gaagatggat ccctccatgt cttcgtgctt gctgtggaga tgctacagct agaagaggct 2700 

gtgggagagg ctgggctggt accccagctg cgtgtgctag aggaatactc tgtcccctgt 2760 

gcacatgctg cccatgtgac aggcctcaag atcctaagcc caagcatcat ggtctcagcc 2820 

tccattgatc aacggctgac cttctggcgt ctggggcatg gtgaacccac cttcatgaat 2880 

agcactgtgt tccatgtgcc tgatgtggct gacatggact gctggcctgt gagccctgag 2940 

tttggccacc gttgtgccct tgggggtcag gggcttgagg tttacaactg gtatgactga 3000 

ggtatcctgc ggtggctggc gtgctgggca tggggcctgc tcacagacag catggagcag 3060 

ggaagggctg tctgtgccca tgctcagcat gccttgaggg gaggaggtgg tggccgtggg 312 0 

ttcctgatgt cggtgcagga gctgaaggtg agtggagtgc tgccaagaat atgcccgact 3180 

ccccatgaca agacagaact ttgtaacaaa cagtaccaat ttattttggc cgtgggtttt 3240 

tgcttttttt ccagttgatg actttgtgaa cattcccagg tattggagcc tctgtggcct 3300 

taaatgtggc tcagtggagg gagacccagc atagccaggc cagtatggag cacctcacgc 33 60 

acagctctca gaagctgcag gcggacgaac atctgaccaa agaggtgtgg tcgaggctcc 3420 

tgaaagagaa agggcctgct ggtctcatcc tctgcttcct ttgcctttac cctatacctc 3480 

tctgcacgtc ccaccccgtt ttgctgtgtg ctcaccccca ggatgtgtac ccggttgtag 3540 

taggagctga aatccatgct gagctgtacc aggaacttgc atatctagag acagagactg 3600 

agtcactggc ccatctcttt gctcttgtgc cccaggccag aataaagaat agagtgtara 3660 

gtraaaaaaa aaaaaaaaaa aaaaaactcg ag 3692 

<210> 16 
<211> 1428 
<212> DNA 

<213> Homo sapiens 
<400> 16 

agcagggttt gagcctcctg gagacattga atttgaggat tacactcagc caatgaagcg 60 

cactgtgtca gataacagcc tttcaaattc cagaggagaa ggcaaaccag acctcaaatt 120 

tggtggcaaa tccaaaggaa agttatggcc gttcatcaaa aaaaataagg tactgatggt 180 

tggcgtgaaa tgagttttct aaggtgtgga gattttgact tgatctttta gtcttagaaa 240 

aactaagatc ctaaacctgt agtttcagaa tgcaaaagaa gaagctagtg tgctacctta 300 

tgttgagaca gtatttcttt ttggtggtgg tatctttgcc atggccctgt gtcttatttc 3 60 

agatgcatta tcctcgtacc gtgactccca cactaacaga gtactgacct ctccaccgtt 420 

tcgcctcatg cctttccctc cttcctctcc tagactgctg gttaccttgg ctgggagaga 480 

ggatgtagtg ggacattcct gtaacacttt atccgcacat ctactggaaa tcgttaccat 540 

gttaataact tggttttgaa ttcatgttaa catgtgtacc catgaacatt tttcattttc 600 

ttttcatagt gcgatacata ggtgcatgac agcattaacc tggggacgta gaatatgatc 660 

aaggcagcat tactgcttta actttagaat gacttactat ttattaattt aaacagactg 720 

ctgtttccac aaccttagca ttgaaggtct ttcattttct cccatcaagc tatgttagtt 780 

taggtaatgt agaaatattt accctctggc ttaagctggt ttagagtaac taactagagc 840 

tatagtttgc atgggaaagt ctgcacgagc ttcttgtcag atatttcttg ctcttctgtc 900 
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gcattactta 
ttttcttttg 
aatattattt 
cggctccatc 
tccccctcct 
gcaaaaggaa 
ctgcttcagg 
gctttggttg 
tttcccataa 



ctaaacctcc 
gaccatggcc 
gatgcttatt 
tcttacatgt 
gcctctgcct 
cccctctccc 
agcctaaagc 
catgtttggt 
catataaaaa 



caactctcat 
taaaatttaa 
tttgttttgt 
agcttatgtc 
caccctctgc 
accgcttcaa 
gtggggtaag 
tctgcataac 
agttaaaaaa 



catattcttc 
ttgtttgtgt 
gtcttcttgt 
ccttttaaca 
tgttcccaac 
cgagttcatg 
ttctgctccg 
taattttgtt 
aaaaaaaaaa 



atttaaccac 
tttacttgcg 
ttctgatttt 
tccccccatc 
ggcccccagt 
acctccaaac 
gaatcctgtc 
tgtgaatgaa 
aactcgag 



ctcctacatg 
ttggatttca 
tactctgtca 
agcctccccc 
ctcccaagca 
ccaaaatcca 
tctctggcgt 
tccattgtgt 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1428 



<210> 17 
<211> 1489 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (7) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (345) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (549) 

<223> n equals a,t,g, or c 



<220> 

<221> SITE 
<222> (1408) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1477) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1488) 

<223> n equals a,t,g, or c 



<400> 17 

ggagganagg atgatgatga aggaccgtac acaccattcg acaccccctc gggtaaactg 

gaaacagtga aatgggcgtt cacctggccg ctgagtttcg tcttatactt cactgtaccc 120 

aactgcaaca agccgcgctg ggagaaatgg ttcatggtga cgtttgcttc ctccacgctg 180 

tggatcgcag ccttctccta catgatggtg tggatggtca caatcattgg ttacaccctg 24 0 

gggattcctg acgtcatcat ggggatcacc ttcctggctg ctgggaccag cgtgcctgac 300 

tgcatggcca gcctcattgt ggccagacaa rggatggggg acatngctgt gtcaaactcc 360 

attgggagca acgtgtttga catcctgatt ggcctcggtc tcccctgggc tctgcagacc 420 

ctggctgtgg attacggatc ctacatccgg ctgaatagca gggggctgat ctactccgta 480 



60 
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ggcttgctcc 
ctggacaana 
atcatgactg 
accgccgggt 
cacccgartc 
cctccgctcc 
cttgggtcat 
gtgacttttc 
actggcctga 
gaagactcac 
gattacagag 
tggggggatt 
gagcttctct 
cctgccgtgt 
ttttagagga 
gacgcctcac 
gaagaagaca 



tggcctctgt 
agctgggctg 
agttcaacgt 
gcccacagar 
acacaggccc 
tgttttggtg 
gcccacccac 
cagctccatt 
gccaggcaac 
ctaatttgtg 
agcatttggg 
gttttctttt 
tgcatcttct 
tgtctgggct 
gccactgggc 
taaagtctta 
acttcctcat 



ttttgtcacg 
tgggtgcctc 
gttcaccttt 
gctcagctcc 
ctggggccac 
gcccargctc 
cctttcctgc 
tttgaacagt 
actgattcca 
acctgagact 
attttgtttg 

gggggccttc 

tccactgggc 
tgcatcccag 
aaggccacca 
tgggcgtncc 
tccatcatca 



gtgttcggcg 
ctcctgtatg 
gtgaacctgc 
ttcttttctg 
ggcgttcgtc 
tcccctgccc 
ctcctccgtg 
gactgagatt 
atccctyytc 
gttgaagaaa 
gtttggagat 
cccttttact 
cccggattaa 
atgtgttgga 
agaacaaatg 
ctggggttgg 
ccatctnttt 



tccacctgaa 
gtgtgttcct 
ccatgtgcgg 
tgcaatacga 
tctcctgtgc 
catcctcgct 
tkaagacatc 
ctagaaaaac 
cttttttaag 
tagagaggag 
gatgcctagg 
ccttttcttc 
tcaattaccc 
gtatgcatgg 
catgacattt 
gggggcacaa 

ctcactang 



caagtggcag 
gtgcttctcc 
ggaccactga 
racccggccg 
tgtcctcagg 
cccccacctc 
caacatccac 
ccggctgcta 
ttatttgatg 
ggggcccgtt 

ttactgggtt 
cagagatcaa 
aaaggctgca 
atgtagtgct 
tatagccaag 
ggttttggag 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1489 



<210> 18 

<211> 1940 

<212> DNA 

<213> Homo sapiens 



<400> 18 

acgcgtccgc 

actattgctt 

tctccagtcc 

aggtagttgt 

ttatatcgca 

tcactccttt 

ccaacagttg 

tttcattccc 

tcaaagtata 

aggcccacca 

tctgctcact 

cgcccaggcc 

gcacagccag 

gggaactgag 

gaagacagtg 

aaactgtgat 

agcatcaagt 

aaactaagtt 

ttaaggtttg 

aggggatgag 

cttattcagc 

cagctgggga 

ctggtgagtc 

acaggctaga 

ccgggattgt 

gttgcagtga 

attttattac 

gtgcagtcgt 

taggtgttgc 

actttttaag 



ttcccagaaa 
tgagggccca 
attatccccc 
ggaactggtg 
caggacacat 
ggtagaacct 
aagttacaca 
catctggcct 
gtagatttca 
atttcatttc 
gaatgtcttg 
cagcactgct 
taggattgcc 
cccctatagt 
ccttggtttg 
ggaacataat 
ttctacttct 
ttcctgagcc 
gttaaagcag 
cactctgcag 
ccaagagtgg 

ggggtcaggc 

aactcacaga 
ccccagattg 
ttgccactaa 
aaagcataat 
tgcatgttcc 
tgcagcggct 
tgcacaagag 
aaattaattt 



atagatgaca 

ggtactgaaa 

tcccttgctt 

attttcaaaa 

cagcctgagc 

tgggatttta 

cattgctccc 

ttaccttctt 

acctcacaca 

acgcgccagg 

catgtgctga 

tgttgggtca 

tcagccaggg 

gggcagtctc 

tgttttgaag 

aaaactggag 

ggctggaaag 

cattttcctt 

tggaaaggag 

ccgattaatc 

aggctgttta 

ccctaaattg 

ttctgcagca 

caggggctgc 

atagctggag 

cctatgtgat 

cgtcccgtct 

ggtgcaggag 

ttaaccagag 

atttgagttc 



tcagtgcccc 

ctggttgtct 

ctgaagcagt 

gccccacttt 

tgctgcctca 

gaaattgtgg 

aaatttggaa 

tgcttcagtg 

aatgacaagt 

gcggctgcag 

ctgctgcccg 

gcatctagtg 

ggtttatcag 

ctttaccttc 

caaacaagtc 

atatggtttt 

caaaaccaat 

tgagccctga 

gaggaggcag 

tgttggtagg 

cagcgagccc 

aagaccactt 

gctgctccac 

cactacaagg 

tcacagattg 

gaatttatat 

tgtgaatgtg 

tgccactggc 

tcaat:gccaa 

aaatattttt 



ttgctacttt 

tgagttttgt 

ctaggttaaa 

agagatcagg 

tgcctgtttc 

ctttccataa 

atagaccaca 

gttgaaaaca 

cccattttac 

ttggaggccg 

cagtgctgaa 

ctgctgtcac 

aaggtgtgca 

ccacctccct 

agctttctgg 

taacactgca 

ctcagctgac 

cctagcctgg 

gggtggatgg 

ggcccagctt 

tggagatggc 

tggtagcaga 

ccacaataaa 

tgggaccaca 

agataaatgc 

gtgttatttt 

agtccccgcc 

gcgtgtgtga 

acacatagta 

gaaatataaa 



ctcagtcctc 

gtcagctttt 

ctagccaggc 

ccacagcttt 

cccaggaacc 

ctcatttact 

gtaccttacc 

gttgccatat 

aatcctagga 

agggcagccc 

catgccccac 

atctttgtct 

aggcctttgg 

gaaaagcaca 

ctttgcccca 

aaaaggaaaa 

aaggctgggc 

ccttacctca 

gggtgtgggg 

cttgggagtg 

agcttgtctc 

actgtaggga 

gcaaacgccg 

ggctgcctca 

caccttcaag 

ttaaaaagct 

accacgtgag 

tagcatctcg 

tgagaagtgt 

aattggttgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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10 

attttttaaa gctataattc ttgtagacat tctgtggtta aaaatttgat tgtgcttatt I860 
aaaaatggtc atctatgttt tgcacttcag ctacgtgaaa ataaaatttc tttgggaagg 192 0 

1940 

t-gaaaaaaaa aaaaaaaaaa 



<210> 19 
<211> 1592 
<212> DNA 

<213> Homo sapiens 



<400> 19 

ccacgcgtcc 

tgtttgagcc 

tctgataata 

ttccccactg 

aggggcagga 

gctttctgac 

atcctcatgc 

gacagcttgt 

gcaaggatgg 

taagaccggg 

agaagacttc 

aggatggggt 

gaaaccagcg 

ccggtgagta 

cctaacagcc 

actggccacc 

aggggcaaag 

caccaaacag 

ctgagaaacg 

cagcattgca 

gtataagacc 

tgctaggaga 

cactgttgta 

acacacaaat 

gtactcccag 

tgcatgagct 

ctttaaaaaa 



gagcaattta 

atgttatgtg 

ctgcagtttt 

taggtcatat 

aagtggtcag 

ctgagagctg 

catcctctga 

cagccctggg 

agtgggttcc 

aaggccagtg 

cccgcatgag 

gcatgaagtc 

ccagcccaca 

ggggaggtcc 

tcctgtcttc 

aacagtgtcc 

aagtcatctc 

gtgtgagtga 

agtgaacagt 

aaaatacagt 

aagagtagca 

ggtatttacc 

agcctttacg 

acattcttct 

cattttggga 

atgatcacgc 

aaaaaaaaaa 



taaattgata 

acctcatctt 

tctgtagtgt 

tattgtcccc 

tgaggtgctg 

gcagtgccca 

ataaacatcc 

ccagtgctgt 

atgtggggag 

ggagccagcc 

gctcttggta 

caagttcctc 

ctgtcctgtg 

cagtttccct 

ccaaggttgg 

tgactcgact 

tctttcctgc 

gcatctcagg 

aatgatagca 

aacaattgtc 

gcaggaacaa 

aacttctttt 

gttattaaca 

tgaaaataaa 

ggctgaggca 

cactgcactc 

aaaaaaaaaa 



ccagtaatac 
gttatttaat 
ttgctctgag 
attttgccga 
gtggggtgga 
cccaacagct 
atcacgctgg 
ggcacgtgga 
gataagggtg 
cgggggcagg 
actggggctc 
caggtcggag 
tacgtgccgg 

gggggctgac 

gcagggtgag 
ctccgggacg 
agcccaggct 
cctccggcct 
gctgaagtga 
atactaatag 
cagcaagagt 
atgtttatac 
ccctaatcct 
aacacaggcc 
ggaggattac 
cagcctgggc 
aa 



ggtgccttga 

ataaaaatgg 

cccgacactg 

tgaagacctg 

ggggccaagg 

ctacctgtta 

tccttggtac 

catggggaag 

catggagctg 

ggacctggaa 

acctcccttc 

gcaatacatt 

atcccacatc 

ggatgctgcc 

cttggggtga 

cagcgccagc 

tggcccctca 

gggcagaagc 

caatgtaatg 

taccaaccat 

tgtaaaaaga 

aaagctgact 

cacaataatc 

aggtgcagtg 

ttgagcccag 

aatacaggga 



caaactagat 

caatttatca 

cactgagtat 

agaggtgggt 

atcagctgag 

catttctgtt 

ccacgacaat 

cccagaggtg 

gcaggggagt 

tgccagcatc 

tgtctgctgc 

ctcaaaaact 

caccatcaag 

ccaacattgc 

aagggagctg 

cagcatgcat 

gcctgagctc 

aaaagcagtg 

gtagtatcag 

aattgcgtga 

gaagcagtca 

atgaacccag 

ctattgtgtg 

gctcacactt 

gagttcgaac 

gaccctgtct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1592 



<210> 20 

<211> 1410 

<212> DNA 

<213> Homo sapiens 



<400> 20 

gcccacgcgt 

tattttcttt 

ggaatcattt 

ctctactttt 

ttttttgttt 

ttaggtcagg 

ctgaattgaa 

atgaatatta 

ttcctccttt 



ccgagaaaaa 
ccatgtgccc 
aacagagata 
ttactgtcac 
actacacctc 
tactatttct 
ttttcctcag 
aataacttca 
ttaagtgttt 



tgctgctcag 
actgtgtgta 
cttgtaaaaa 
ttgcagtttt 
caatttgtag 
gtcatttctc 
cagcaaacta 
catacatcat 
gtaaactaca 



tttttattgt 
cctgttgcac 
ggacttttgt 
tagattctct 
tgaaaagaat 
agtctcgtaa 
gagatagcaa 
gagtgcaagt 
cagagtatct 



ctaccaatgg 
atatcctgta 
ttttctatac 
gaaagattct 
gggctgctat 
tcttgggcag 
ttttttatta 
gctcaataaa 
caaactgcag 



taagtatiaca 
gcctaggaga 
agaatgtaaa 
ctgatagcaa 
accattggat 
gttactaaca 
tagtattatt 
tgttaattta 
atacaaaata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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II 



ctcaaaggat ggtctccatt ccaggatacg ctataggaga gcactttctt acttgatcac 
cattagcata ttgccttctt cccagcaatc cacatggctg gaaggagatt cctctcctac 
tgtttacttg ccaagggaac attttttgtt gttttttgag acaatgtctg tcgcccaggc 
tgaagtgcat tggtgtaatc acagctcact gcagcctcga cctccctacc tcagtctcct 
gagtagctgg gaccacaggt gagtgccacc acacccggct aattttttaa aaacattttt 
gtagagcctg ggtaacatgg ggtggaacaa gcctgtagtc ccagatactc aggaggctga 
ggtgaaagga ttgcttgggc cagggaggtc aaggctgcag tgagccgtga aaggccacto 
cactccagcc tgggtgacag aatgagacct tgtctcaaaa aaaaaaaaaa agtttcttgg 
aacctatacg tttttttttg tttttttttt gaaaagccag accttgtgcc cttgttttga 
acaccgactg ggaagatggg gcttaggtaa cagccaaacc tggctgtcag ctgtgtggga 
gccaccaccc tctctgggaa gagttcctgc ttctgtatgg caagcataaa tcaagctcag 
tctgggttat ggagaagttg aaaattgttt tgttcctcat tagtttataa ttgtatgaaa 
tacgatttta atgaaaactt ttcagaattc acgtttgtgt agatatttca gagaaccatt 
tttactttac atcctaaaac tgccttttcc tatggttttg tcaataaaac actatgatgt 
tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



<210> 21 

<211> 1727 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (979) 

<223> n equals a,t,g, or c 

<220> 

<221> SITE 
<222> (1047) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1135) 

<223> n equals a,t,g, or c 
<400> 21 



ccacgcgtcc ggccatggtt gccactgtct gtggcctcct ggtcttcctg agcctgggcc 
tggtaccccc agtccgctgc ctgtttgcac tcagcgtgcc caccctgggt atggagcagg 
gccgccggct gctcctgtcc tacagcactg ccaccctggc cattgctgtg gtgcccaacg 
tcctggccaa cgtgggtgcg gccgggcagg tgctgaggtg tgtcaccgag ggctccctgg 
agagtctcct caataccact caccagctgc atgcagcatc cagggctctg ggccccacag 
gccaggcagg cagccggggc ctgacatttg aggcccagga caatggctct gccttctacc 
ttcacatgct cacggtcact cagcaggtcc tggaggattt ctctggcctg gagtccctgg 
^^^f^^'^^r ^^^^^^.^Srgg acccagcgag tggtcacagg gctgtttatg ttgggcctcc 
Inl^f^^^f ^^^^^^^^^^ ctccattgct acctgacaga cctgcggttt gacaatatct 
acgccactca acagctgacc cagcggttgg cacaggccca ggctacacac ctcctggccc 
at^o^o^^^ ctggctgctc caggcggctc agctgaggct gtcacaggag gagctgttga 
gttgtcttct aaggctgggg ctgcttgccc tgctcctcgt ggccacggct gtggcggtgg 
^^^o^^r" ^^^^^^^"^ ctcctggcac aggctactgt ggactgggct cagaagttgc 
caactgtgcc catcacgctc acggtcaagt atgatgtggc atacactgtc ctgggcttca 
tccctttcct cttcaaccag ctggctccgg agagcccctt cctctccgtc cacagctcct 
accaatggga gctccgcctc acctccgccc gctgcccact gctacccgcc cggcgtcccc 
gcgcagctgc cccgctggnc gcggggggcc tgcagctcct ggcgggctcc acggtgctcc 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1410 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



BNSDOCICh <WO 9946289A1_L> 



wo 99/46289 
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tggagggcta cgcccgccgc ctgcggnatg ccatcgccgc ttccttcttc acagcccagg 1080 

aggcgaggag gatccgccac ctacacgccc ggctccagcg aagacacgac aggcnccaag 114 0 

gccagcagct gcccctaggg gatccttctt gcgtccccac acccagacct gcctgcaagc 1200 

ctccggcatg gatagcctac aggctggatg ccttaagaac cgagagcagt gagggagaag 1260 

ggaaagagct ttggagttgc agagacctga gttgtcacct tggtcctgtg ccgcctccct 132 0 

gtgtgacctt gggtaagtca cttcacctct ctgagcctcg gtttctacat ctgcataacg 1380 

acagcatatt taccattgat gtgacctact tcccacgcag ggatgtggtc aggatggaag 1440 

gaaatactgg gcatgatagg cctggataac cggtaaagaa ccatgcaaag gcgaagacaa 1500 

ggagtgcaga gagagctcat ggttcctcca ggctggttgg cgatcaggct catctcatct 1560 

gcaccaactg ctctacttgt tagatggaga ccttgcatca tgaatttctc gaaatgctcc 1620 

tggaacttat ttatatgcct caaaatcctc taaactcatt tatagtaacc catagtttta 1680 

attttataaa taaacgtatt tattaaatct taaaaaaaaa aaaaaaa 1727 



<210> 22 
<211> 1218 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (389) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (740) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (1048) 

<223> n equals a,t,g, or c 



<400> 22 

gaaaatagaa taaatgccca tccataagac taaaatttct tgtgtttttc tccttctgag 60 
tttaaaatgg cactggatga caaatggaaa gcttgatgct gctcttaatg tgccgctagg 12 0 

attccgggga tttcaaagcc agtggacggg aggtggcctc tgccagtgtc tgtctggtgt 180 
ctgtctgtgt cactgtggtg ctgcctgggc cacagaccta ggcaggaccc tgggtgatgg 240 
agctcctgtc tggtgggtgt gtgtgggcag tgctgttcct gtccacgtta gaaaagccct 300 
cttactttac actgagtcat gctccctctc caccacggac cgcagtcccc ttccctagtg 3 60 

actcgctgtc cccttccttt gttgcgcant ttctggcttt aaatgaggag agcttaagaa 420 
tggatgggga gctcagcact cacagtaact gttggtgaac tcagggcctg ctacgtctgg 480 
aacacatcaa gccatttagt gggtgaggtc attcactgtt tttaaatgct gctgcagctc 540 
ttatttctca tgaagccctt tatacctatt aaatacttca tagtattgaw taacttagct 600 
gsytgctcct ctctgtcatg gcaccttttg ctcatgtgga ctttawggtg cagaaacacg 660 
aatcgattgt cgtaatgaac aamamccctc tgaagtggcc acggcgggta tgattcgtcc 720 
cagttcacgg gcgagtaacn gaggtgcgca gtggcggggc agctggccca ggtcgtgcag 7 80 

ctgctgtgcg tgagccagct cgctcctgag tttccttttg tttgacagca ttttgtttac 840 
agacaccaca ccaatccttg gtcttggata catcagaaaa gttggagttc tagaggtggg 900 
tggaggcagg acttgtaccc tctccctgca gcaaagacaa attcattaag catttggaac 960 

acttgttaag ttcagtttgt ctctctctaa aagttatcac tagatgactc tctcattttt 1020 

gtgtgtgcgt gttttagatt tgcctgtnac ttacgaccag ggatactggc tttctattta 1080 

tggtagtaat agcagttctc cttttaaata aacttatttt cagccaaaag agtgattagg 1140 

tctatcaaaa aatgataagg aaataaacag tacagatcgt ctatatttat ggcaaaaaaa 1200 




wo 99/46289 
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a.aaaaaaagg gcggccgc 



<210> 23 

<211> 712 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (26) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (28) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (77) 

<223> n equals a,t,g, or c 

<220> 
<221> SITE 
<222> (117) 

<223> n equals a,t,g, or c 

<220> 
<221> SITE 
<222> (124) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (696) 

<223> n equals a,t,g, or c 
<400> 23 

taggcccggg acggttacaa tttacncngg aaccgctttg cccataggct ttgcaaaaag 
ctttttaggt gccactntag aaggtacccc tgaaggtacc ggtccggaat tcccggntgg 
accnacgcgt ccgaggaggt cytttaggaa gactctcaaa ggcaaatccc tgatcccccg 
ccccaccctt agccctgccc tctcaccaga gcaaaattca ctggggactt ttcccaccac 
acatggaaat ctgtccactc ggaatacctc tgttttccat ttcaaattgt agggggaggg 
gatggaacac ttccagtgat ggtaagagat ctgttatgaa acgaaacacc ccccgtgtta 
ataacttggt ctgaaatctg tttttatgag ccgggccccc tgtgcctcta gtatacttgt 
attgactctc atagttaccc ttttagtttt actgtgttct gtgaaaattt gtaattggtt 
gagaatcact gtgggcgtcc attcttattc aactaaatct ccacaggttt tttgagctgg 
tgtggattag tttaactctt gtattcaacc attagtgcta ccaccttctc acattacaat 
acaattactg gaagcaagta ctgcatttcc tatgcaacaa aaaaggaaaa ataaaaaatt 
gctaatgcta aaaaaaaaaa aaaaaaaaaa aaaaanaaaa aagggcggcc gc 



<210> 24 
<211> 1422 



1218 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
712 
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<212> DNA 

<213> Homo sapiens 
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14 



<400> 24 

gtctccgctc 

taacatctct 

tactttggct 

atggcagaag 

ggttcatttt 

gggtcagcag 

ttgattacgt 

ctggtgaatt 

aatgcaactg 

agcgacgtgg 

cctaggacct 

accactttac 

cacactggca 

gttgacctta 

actgtttact 

tgaattcatc 

tcttttggag 

gattttacct 

tttcctcatg 

atagtcttcc 

agttgagcaa 

ggagatctga 

atacacactg 

aaaaaaaaaa 



ctgtgcccgg 
ttttgatcaa 
taaattttgc 
tagaatcctt 
ggtggctgtc 
ggtgaaaccg 
gcatgcaaac 
acagacctga 
catcccagga 
aacacacttc 
ttctacgaga 
tctactcctt 
ctgcagtagg 
ttttgctaat 
aaaaagagct 
tctacaggct 
ggcatttgtt 
tgcagacgga 
cacaaaacat 
tttctgtaca 
aatgaatatt 
agctgaacaa 
aaattctgac 
aaaaaaaaaa 



gaagatggtg 

acaagaaaaa 

ccccagaatc 

atagaaacag 

cgtttgacct 

aaccccagaa 

agcgatttgc 

tgaatttata 

acttggttat 

agtccagtgc 

aaataaacct 

cttcctggga 

gaagctgttg 

tactggaggg 

gccatcatgg 

caaaactcct 

tggttaagaa 

atgacaagca 

aaaggatagt 

ttgctatatc 

tggatccatg 

tgaaaatctt 

tttctgaatt 

aaaaaaaaaa 



ctaggtggtt 

gcatttggga 

ttgttgtttg 

gaccagaaac 

gctgtgcttc 

aacttgatga 

aaagaccgta 

gaatgtgaag 

ggttgtctca 

catgccttag 

tgtataaagt 

tgttttggtg 

acgcttggag 

ctgatgccaa 

cccagggagg 

ctttgatatc 

ggcttctttg 

aaaagtgttg 

ggcgagttta 

ttcagtcctt 

ttcctcttgt 

cagcagaaat 

taaatgtaga 

aaaaaactcg 



gcccgaatca 

aatgcaaaga 

cctactgaag 

acctcccttc 

agtttaattg 

agaaatgtct 

tgatgatgat 

acccagtgga 

agttcggcgg 

atggaattga 

ataccggaca 

tggatcgatt 

gacttgggat 

gtgatggcag 

cgggtgaaag 

agacctgatg 

gactttggaa 

tggggaatca 

caagctgtgg 

tggagcaagt 

gaccctgagt 

agaaatggcc 

ataaatttta 

ag 



cgccattttt 

ggactgagaa 

agatgaaacc 

Cccaacaaaa 

gcttggaaag 

tttgcccgtt 

catgatctta 

tcatgttgga 

tcaggcctac 

gtgtgccagt 

ctacttcata 

ctgtttggga 

ttggcggttt 

caacnggtgc 

ctccgtcttc 

ttattttcct 

tttcaaccca 

aatttgttcc 

atgggtttcc 

ggacctaaca 

cttcacgcaa 

gtggattgta 

ccaacttgga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1422 



<210> 25 
<211> 1038 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (806) 

<223> n equals a,t,g, or c 

<400> 25 

ggcacgagtg gctgcagcgg ggcccgcgtg gtgcctcctg aggcggcccc cggatgaaga 
gatctgggaa cccgggagcc gaggtaacga acagctcggt ggcagggcct gactgctgcg 
gaggcctcgg caatattgat tttagacagg cagacttctg cgttatgacc cggctgctgg 180 
gctacgtgga ccccctggat cccagctttg tggctgccgt catcaccatc accttcaatc 240 
cgctctactg gaatgtggtt gcacgatggg aacacaagac ccgcaagctg agcagggcct 300 
tcggatcccc ctacctggcc tgctactctc taagcrtcac catcctgctc ctgaacttcc 
tgcgctcgca ctgcttcacg caggccatgc tgagccagcc caggatggag agcctggaca 
cccccgcggc ctacagcctg ggcctcgcgc tcctgggact gggcgtcgtg ctcgtgctct 
ccagcttctt tgcactgggg ttcgctggaa ctttcctagg tgattacttc gggatcctca 540 
aggaggcgag agtgaccgtg ttccccttca acatcctgga caaccccatg tactggggaa 600 
gcacagccaa ctacctgggc tgggccatca tgcacgccag ccccacgggc ctgctcctga 660 
cggtgctggt ggccctcacc tacatarcgg ctctcctata cgaagagccc ttcaccgctg 720 
agatctaccg gcagaaagcc tccgggtccc acaagaggag ctgattgagc tgcaacagct 780 
ttgctgaagg cctggccagc ctcctngctg ccccaagtgg caggccctgc gcagggcgag 840 



60 
120 



360 
420 
480 



wo 99/46289 
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aatggtgcct gctgctcagg gctgcccccg gcgtgggctg ccccagtgcc ttggaacctg 
ctgccttggg gaccctggac gtgccgacat atggccattg agctccaacc cacacattcc 
cattcaccaa taaaggcacc ctgaccccaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aatttggggg ggggcccc 



900 
960 
1020 
1038 



<210> 26 

<211> 1906 

<212> DNA 

<213> Homo sapiens 



<400> 26 

ccgcaacgca 

ctgctttgtc 

gaagaggggc 

ttaaactttg 

acttttttca 

ctggccacat 

gggatgtact 

ttaaataaag 

gcaacgattc 

aagctggaat 

ctcagggcct 

agaaacactg 

ccgaggaaag 

actttcccag 

acgtgaccat 

cggg^cttgt 

agctgctcct 

ggacacttgc 

attttattat 

caggtggaaa 

tcctcacagt 

tctatcattc 

agtttcatgg 

gtccagctat 

tattatccct 

gcctactatt 

gaattcaagt 

ggtcagttgg 

tctcagaaaa 

gcacaattcc 

gaaacaactt 

aaaaaatttt 



gtgagctcgc 

ttgaagacag 

cggctccctg 

acagtcccag 

aggaattctg 

actatgggtc 

atggaagaga 

atggcaaagc 

agtttcacca 

gttactttgg 

gccgccccat 

gcgcctctac 

gatcggcagg 

aggaaccatt 

cagcacctac 

atttgatact 

gcwggtggaa 

agaccggctg 

gcacagactc 

gttaccgagg 

actgccggat 

cttaaagaat 

acttcgcttc 

ttctgtcaag 

ctggacagaa 

ttatatgcat 

ttccttacct 

ccaaatgtac 

tgacaaaacc 

aatgtatcat 

ctttaaagag 

aaaaaaaaaa 



ggccgcgagc 

aacgatgcca 

taagcagatg 

tagtctcttt 

ggagcagaag 

cctgttcaag 

tgtgaatgtc 

acactttctt 

acctcagaga 

ctccttggtt 

tatgatgatg 

caccccactg 

ccggtgcatg 

catcaagcgg 

cagaacaatt 

gcaaaggaag 

tccacaactg 

gagggcacca 

cccccttact 

ctggacagtg 

caagatcaat 

agtagagaga 

cctttgtcac 

gacctgaaac 

tgtttaattc 

atattaaaag 

gcgtatctac 

taacgggcac 

atattttgac 

ttagaattga 

cttctttggc 

aaaaaaaaaa 



aacaggccgt 

aagaaagcaa 

aagttagaag 

gaaagtttaa 

ccccttctca 

ctaacagatc 

tgccggtgtg 

cagctgagaa 

tttaaggatg 

ggctcgaatg 

tcgaggtttt 

tgcccctggc 

agtttatgct 

acactcctgc 

catggggaga 

acgtggagtt 

ttgctacaag 

aagaactgct 

ctgcgggaga 

tagtgagact 

ctgatgaagc 

cacacatgat 

atttggatgc 

ttactacaga 

aagtagtcta 

aaaagcaaag 

taacaaacat 

atgaaagaaa 

aagtttattt 

atgtcattaa 

tctagaaaaa 

aaaaaaaaaa 



gccgrgtttg 
agcctacagg 
cagctggggg 
tctcgcccat 
ttcagagaga 
tgaagagtct 
tcaatgggaa 
aagattttga 
agctttggag 
tgtacataac 
catcctgcag 
acgagagtac 
gaagccgggt 

ggggctggcc 

tttccttttg 
acggaccggc 
acgattaagt 
ttcctcagac 
tggggcagag 

gcagtttaaa 
tcaagaaaag 
gggaaatgag 
actgaagcaa 
tgaggaaaag 
gtgcctttgc 
acctgagccg 
gagacctccc 
gaacagcaaa 
aatccagtgt 
cctggctttg 
tttcaaacaa 
ctcgag 



catttcctta 

gagtgggaag 

gccttcagct 

caagacagag 

tgaccctgca 

gtgcagccgg 

gaagaaggtt 

tcagaaaagg 

gatccaggag 

tcccgcagat 

ctggagggag 

agcgtggagg 

gatttgttgt 

cactcgactc 

gataccatct 

ataccccggc 

ggcttcctga 

atgaagaagg 

ctgtcaacac 

gaccacattg 

atggtgtaca 

gaggaaacag 

atttggaata 

gaaagcctgg 

agaatcaaat 

aggagaggat 

tgttacaggt 

ttaccaagtg 

ggtagaaaag 

ttctttggaa 

ttaaaataag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1906 



<210> 27 
<211> 847 
<212> DNA 

<213> Homo sapiens 
<400> 27 

tggtggcggc atacatcgcc ttcacaatgg cgctctgcag ctgcgtgttc tgcagcgtgt 
cgagcatctt catctgctcc atcacgctgt aaaacacatt tgcaccgcga gtctgcccgt 
cctccacggg ttcattgcgg cgcagtgtag acctgggarg atggscggcc tgctgctggc 



60 
120 
180 



9 
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240 
300 



tgcttttctg gctttggtct cggtgcccag ggcccaggcc gtgtggttgg gaagactgga 
ccctgagcag cttcttgggc cctggtacgt gcttgcggtg gcctcccggg aaaagggctt 
tgccatggag aaggacatga agaacgtcgt gggggtggtg gtgaccctca ctccagaaaa 360 

420 
480 
540 
600 
660 



caacctgcgg acgctgtcct ctcagcacgg gctgggaggg tgtgaccaga gtgtcatgga 
cctgataaag cgaaactccg gatgggtgtt tgagaatccc tcaataggcg tgctggagct 
ctgggtgctg gccaccaact tcagagacta tgccatcatc ttcactcagc tggagttcgg 
ggacgagccc ttcaacaccg tggagctgta cagtctgacg gagacagcca gccaggaggc 
catggggctc ttcaccaagt ggagcaggag cctgggcttc ctgtcacagt agcaggccca 
gctgcagaag gacctcacct gtgctcacaa gatccttctg tgagtgctgc gtccccagta 720 
gggatggcgc ccacagggtm mwgtgacctc ggccagtgtc cacccacctc gctcagcggc 780 
tcccggggcc cagcaccagc tcagaataaa gcgattccac agcaaaaaaa aaaaaaaaaa 840 
actcgag ^^'^ 



<210> 28 
<211> 985 
<212> DNA 

<213> Homo sapiens 
<400> 28 

ccacgcgtcc ggcacagatg agagcgctcc gaagactgat tcagggcagg atcctgctcc 
tgaccatctg cgctgccggc attggtggga cttttcagtt tggctataac ctctctatca 
tcaatgcccc gaccttgcac attcaggaat tcaccaatga gacatggcag gcgcgtactg 
gagagccact gcccgatcac ctagtcctgc ttatgtggtc cctcatcgtg tctctgtatc 
ccctgggagg cctctttgga gcactgcttg caggtccctt ggccatcacg ctgggaagga 
agaagtccct cctggtgaat aacatctttg tggtgtcagc agcaatcctg tttggattca 
gccgcaaagc aggctccttt gagatgatca tgctgggaag actgctcgtg ggagtcaatg 420 
caggtgtgag catgaacatc cagcccatgt acctggggga gagcgcccct aaggagctcc 480 
gaggagctgt ggccatgagc tcagccatct ttacggctct ggggatcgtg atgggacagg 540 
tggtcggact cagcactacg gcggctccgg ggctccgggg acttggcagg ggagctggag 600 
gagctggagg aggagcgcgc tgcctgccag ggctgccgtg cccggcgccc atgggagctg 660 
ttccagcatc gggccctgag gagacaggtg acaagcctcg tggttctggg cagtgccatg 720 
gagctctgcg ggaatgactc ggtgtacgcc tacgcctcct ccgtgttccg gaaggcagga 
gtgccggaag cgaagatcca gtacgcgatc atcgggactg ggagctgcga gctgctcacg 
gcggttgtta gtgtgagtct ggagggtgcc cttcctccac cagccctgtg gggagggacc 
cccaggtcct ctgcattaaa ccagtttaca ctccaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaa 



60 
120 
180 
240 
300 
360 



780 
840 
900 
960 
985 



<210> 29 

<211> 914 

<212> DNA 

<213> Homo sapiens 

<400> 29 

ggcacgagct aaggctaaga aagaacactg tgaaattttc attatataga cattttaaaa 60 

atactctgat ctttgctgtg ctggcttcta tagtgtttat ggggtggaca actaagacat 120 

ttagaattgc aaaatgccaa tcagattgga tggaacgctg ggttgacgat gcattttgga 

gcttcctttt ttcgcttatc cttattgtaa tcatgttttt gtggagacca tcagcaaaca 

atcagagata tgccttcatg cccttaatag atgattctga tgatgaaatt gaggaattca 

tggtaacttc tgaaaattta accgaaggaa taaaattaag agcctcaaaa tcagtttcca 

atggaacagc taagcctgcc acttctgaga actttgatga agatttgaag tgggtagaag 

aaaatattcc ctcttcattc acagatgtag ctcttccagt gttagtggat tcagatgagg 

aaatcatgac cagatctgaa atggctgaaa aaatgttctc ttcagaaaag ataatgtgat 

tggaacccgt ataagaaatg tagttaagcc tgaaggacta tccttcatca agactgaaag 



180 
240 
300 
360 
420 
480 
540 
600 
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tgagctttga tttgatattg cctaaaaatt tttattgtgt tatcttggaa gtctgtgtat 660 

caaaatgaag aattcagatg gtaggaggtt ctatagtcct tttaaagctg actcttgagt 720 

gtcagttgaa tatccattaa attggatttg gaaataacct gaggaaagta ttatgataaa 780 

gatgcctctt agctgatagg tggcaggcct gtgggtttgt gttctccctc 840 

ttttctctgg aacatatgac aattccagat taaagaaaaa tgttttttaa taaaaaaaaa 900 
aaaaaaaaaa aaaa 

914 



<210> 30 

<211> 1183 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (4) 

<223> n equals a,t,g, or c 
<220> 

<221> SITE 
<222> (7) 

<223> n equals a,t,g, or c 



<400> 30 

cacntgnatt catctatcag aacaatggtg tgagcatgaa gaggcacaga caggtctcca 
aaatagatgt taggatttgg gtgctacctg acacagaagt aggtctaacc ctccaagtac 
tggggatgat aggataatca atgaggtata tatatatttg tcattttgta taaaatattg 
tgaaaattga aggaggacac tcagtaaaca tcctgggact atttgtaagt tatggcaaaa 
ccagatgaga gaaaagggac agtcccctct gtatcctcgt tgtctcttag taacatcaaa 
ttgtagttaa aaaaatttta aactatgtac aagctacaaa atagcatctc tttcatggta 
tgtttgagtg tgtaatttta gtttcttttc tggttgtatt tgtggtagtc agatgtgttg 
gattgattcc aactggacag agtaaggaat tccagcatcc tcttcctgct tgctcgtgtt 
accccacaga tcaaaccctc aattctagtt ggggatgctg tctagcccca caccatgact 
gaagccttaa gcactgttgc gcctccatgt gctttggatc agcaacccca gtggtattct 
accagagcat tgtgggaaag cagatgtata gtcaggtccc aayagcaaat tgttgggtgt bbU 
gagagttcta aagtataggg gtgagggaag agaaggatat gaactcctct gaccttaagc 720 
cagcattcat ttaactttta tgtctactta acaagagaac ctggagaaaa ctaccgtatt 780 
caagagatta atcaaaatca gtgttttagc caggcgatga cagagaagca ccattcctca 
ccctccattc ttgtaatgtc tgtaataaat ttcagtgcgt caggatggat gaacccaaga 
tccagtgaat gattcagctg ttccaagcct tacattttcc atcattcatc atccattctc 
attcagtgta acctcttgca ctattgtggt taattttatg taaaaccagt ttatgttttt 
ttttttttaa tatgtgccta tgtaataaag tctacacact ggctatctct gtagaggtga 
ggttttgttt ttagttgttc tactgattat atccttttct gagctatgaa aatgaattat 
taataaaaaa tttttgaaca aaaaaaaaaa aaaaaaactc gag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



840 
900 
960 
1020 
1080 
1140 
1183 



<210> 31 

<211> 2377 

<212> DNA 

<213> Homo sapiens 



<400> 31 

aggggcacga gcctaggtgt tgtcgtccct 
ggatattcag tcatgaaatc agggtaggga 
tgtttgtgtt gcgggggccg gacttcaagg 



gctagtactc cgggctgtgg gggtcggtgc 60 
cttctcccgc agcgacgcgg ctggcaagac 120 
tgattttaca acgagatgct gctctccata 180 



.994e289Al_l_> 
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gggatgctca tgctgtcagc cacacaagtc tacaccatct tgactgtcca gctctttgca 240 

ttcttaaacc tactgcctgt agaagcagac attttagcat ataactttga aaatgcatct 3 00 

cagacatttg atgacctccc tgcaagattt ggttatagac ttccagctga aggtttaaag 3 60 

ggttttttga ttaactcaaa accagagaat gcctgtgaac ccatagtgcc tccaccagta 420 

aaagacaatt catctgggca ctttcatcgt gttaattaga agacttgatt gtaattttga 480 

tataaaggtt ttaaatgcac agagagcagg atacaaggca gccatagttc acaatgttga 540 

ttctgatgac ctcattagca tgggatccaa cgacattgag gtactaaaga aaattgacat 600 

tccatctgtc tttattggtg aatcatcagc taattctctg aaagatgaat tcacatatga 660 

aaaagggggc caccttatct tagttccaga atttagtctt cctttggaat actacctaat 720 

tcccttcctt atcatagtgg gcatctgtct catcttgata gtcattttca tgatcacaaa 780 

atttgtccag gatagacata gagctagaag aaacagactt cgtaaagatc aacttaagaa 840 

acttcctgta cataaattca agaaaggaga tgagtatgat gtatgtgcca tttgtttgga 900 

tgagtatgaa gatggagaca aactcagaat ccttccctgt tcccatgctt atcaytgcaa 9 60 

gtgtgtagac ccttggctaa ctaaaaccaa aaaaacctgt ccagtgtgca agcaaaaagt 1020 

tgttccttct caaggcgatt cagactctga cacagacagt agtcaagaag aaaatgaagt 1080 

gacagaacat acccctttac tgagaccttt agcttctgtc agtgcccagt catttggggc 1140 

tttatcggaa tcccgctcac atcagaacat gacagaatct tcagactatg aggaagacga 1200 

caatgaagat actgacagta gtgatgcaga aaatgaaatt aatgaacatg atgtcgtggt 12 60 

ccagttgcag cctaatggtg aacgggatta caacatagca aatactgttt gactttcaga 1320 

agatgattgg tttatttccc tttaaaatga ttaggtatat actgtaattt gattttttgc 13 80 

tcccttcaaa gatttctgta gaaataactt attttttagt attctacagt ttaatcaaat 1440 

tactgaaaca ggacttttga tctggtattt atctgccaag aatatacttc attcactaat 1500 

aatagactgg tgctgtaact caagcatcaa ttcagctctt cttttggaat gaaagtatag 1560 

ccaaaacata aaaaaaaaaa aaatcctcag tatagcttgc aattaagacc tagatcacag 1620 

tatttaagtg ttttgcgttt tatacatgag gtcagtgcta cagccaccta gcatgaacta 1680 

acccagcttc cacctccata aagttaccta gagttgttga gttggaatat gttctggcat 1740 

ttacctgacc tgccaatcat tagggagagg caacaaggta attcagcctt tcctcctatc 1800 

agcacaaaga aactcaaagc tgttttttcc ctttctgttc caaagcagtc ttatcctgac 1860 

aggagcggtc tatactagtg cagatttcaa cacttttttt taacgtttta attactatag 1920 

tgttatgtag agatttgatt gagcagctaa tgtttctgaa ctttacttac taattttcag 1980 

tgtccttaag ggttctgtag tgttatcaaa gcaaaaagaa aatgctgcat aaaaatacca 2040 

aacttcagca actgttaata ctcagatcat atacctctta ataaatagca tcttatgcta 2100 

attagccctg ctaaactatg tacagaggaa actgttcaag tattggattt gaaagtaagt 2160 

gacttatgtt taacagaact aatgatgtat tgaaacactg tattatgaaa agctaaatta 222 0 

tacatcattg taactatgta gaaagtgtag actaatgtat aatcaaaatg ctaaggattt 22 80 

ttatatggcc ttgtatgagg ggagtttgaa tgttaataaa catgttttcc actttaagat 2340 

ccagtaaatg tctgttctac tgtagtatta cttaaaa 2377 



<210> 32 
<211> 795 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (791) 

<223> n equals a,t,g, or c 

<400> 32 

ggcacagtgc agcatctacc taatccaggt gatctttggt gctgtggacc tgcctgccaa 
gcttgtgggc ttccttgtca tcaactccct gggtcgccgg cctgcccaga tggctgcact 120 
gctgctggca ggcatctgca tcctgctcaa tggggtgata ccccaggacc agtccattgt 
ccgaacctct cttgctgtgc tggggaaggg ttgtctggct gcctccttca actgcatctt 240 
cctgtatact gggaactgta tcccacaatg atccggcaga caggcatggg aatgggcagc 300 



60 
20 
180 
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accatggccc gagtgggcag catcgtgagc ccactggtga gcatgactgc cgagctctac 360 

ccctccatgc ctctcttcat ctacggtgct gttcctgtgg ccgccagcgc tgtcactgtc 420 

ctcctgccag agaccctggg ccagccactg ccagacacgg tgcaggacct ggagagcagg 480 

aaagggaaac agacgcgaca gcaacaagag caccagaagt atatggtccc actgcaggcc 540 

tcagcacaag agaagaatgg actctgagga ctgagaaggg gccttacaga accctaaagg 600 

gagggaaggt cctacaggtc tccggccacc cacacaagga ggaggaagag gaaatggtga 6 60 

cccaagtgtg ggggttgtgg ttcaggaaag catcttccca ggggtccacc tccctttata 72 0 

aaccccacca gaaccacatc attaaaaggt ttgactgcgm aaaaaaaaaa aaaaaaaaaa 780 
aactcgaggg ngggc 



<210> 33 

<211> 2656 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (2652) 

<223> n equals a,t,g, or c 



<400> 33 

gatgagtgcc 

tcaggaaaat 

ttgtcaagat 

gaatttcccc 

attaagaggg 

gaaagatatt 

tatgcattgg 

gctagttccc 

cagggagccc 

tcgagatcag 

catccggcac 

cgctggagga 

catcgaacaa 

catgccgttt 

gatcatcgtg 

gaagagctag 

tacataaatg 

ccccttgttg 

caacgatttg 

gaaaagcttt 

gcagcaggag 

aaagaagaaa 

gaaacagttt 

gaagacaccc 

aaccaggcac 

ttgagttttg 

ataaaacctg 

ttcagggaaa 

atagattcct 

ggcttataca 

aaattaaatg 

ggtttgatta 

atggaagaaa 



tagaagctgc 

caggccaaga 

ttgaatttaa 

aagttttata 

aagagatcaa 

taagctatgg 

aatttgcctc 

cacctagtgg 

tatcttcaga 

taatacacaa 

caaggcacat 

cagaaataga 

ttgaattaat 

tagttcacga 

aaagcagatg 

tgagggactc 

ataaggcaca 

gtatagaaac 

aaaaagaact 

tagcgtctca 

acccagaata 

ctattcaaat 

tgcagtcggc 

caccagaaac 

caaagaaaat 

atgaaaggtg 

aagaagtaaa 

caactatgta 

tgctgttcct 

gaggacatga 

agcaagccgc 

tcatgaatga 

aggatatact 



aatgattgaa 

gcattggttt 

tcaggcattg 

tttggacaga 

gagactgaaa 

ttccggtccc 

aagtaaacct 

ttccatacca 

actgccaagc 

accatttact 

aacggaggaa 

aaatgacacc 

gtactctgac 

aggccaagct 

gatgaagtac 

ttttggtggt 

gttcctaata 

attaccaccg 

agaagaatgg 

gaaattgaga 

tctagagcag 

aattaccaag 

aattaagttg 

cgattatcgt 

tattgagaaa 

tcacaacata 

cttggaggaa 

tctcataatt 

catctgtgct 

tgaagaattg 

agaactcttc 

gtttattgtc 

agctgtagaa 



ggagaaattg 

actgaattac 

ggaagaccag 

tacatgcaca 

gattacctca 

aaacgattcc 

gtttgcactt 

tcacagacat 

acatcacctt 

cagtcccgga 

gaactttctg 

agagatttgc 

aaatctatga 

aatgctgggc 

aatgatattg 

tatagaaatg 

caagaggagt 

gatttgagag 

gatgcacaac 

gagtcagaga 

ccatcaagaa 

gcatcacatg 

gaatatgcaa 

ttacatcatg 

acattactag 

atgaaagttg 

tatgaggagt 

gggctagaaa 

tatcagaata 

atatcacatt 

gaatctggag 

ccatttttgc 

gatatgagaa 



agtctttaca 

cacctgtgtt 

aaaaaattca 

gaaacagaga 

cggtattaca 

ccttggtaga 

ctcctgttga 

taccaagcac 

catcagttgc 

tacctccaga 

tgctggaaag 

aggaaagcat 

tacaagttcc 

actactgggc 

ctgtgacaaa 

ccagtgcata 

ttaataaaga 

attttgttga 

ttgcccagaa 

cttctgtgac 

gtgatttctc 

agcatgaaga 

ggttggttaa 

tagtggtcta 

aacaatttgg 

ctcaagccaa 

ggcatcagga 

attttcaaag 

acaaagaact 

atagaagaga 

aggatcgaga 

cattattact 

atcgatggtg 



ttcagagaat 

aacatttgaa 

caacaaatta 

aataacaaga 

acaaaggcta 

tgttcttcag 

cgatattgac 

aacagaacaa 

tgccatttca 

tttgcccatg 

ttgtttacat 

atccagaatc 

ttatcgatta 

atatattttt 

atcatcatgg 

ctgtttaatg 

aactgggcag 

ggaagacaac 

agctttgcag 

aacagcacaa 

aaagcacttg 

taaaagtcct 

gttggcccaa 

ctttatccag 

agatagaaat 

actggaaatg 

ttataggaaa 

agaaagttat 

cttgtctaaa 

atgtttgcta 

agtaaacaat 

ggtggatgaa 

ttcctacctt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 



9946289A1 I > 
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20 



2040 
2100 



ggtcaagaaa tggaaccaca cctccaagaa aagctgacag attttttgcc aaaactgctt 
gattgttcta tggagattaa aagtttccat gagccaccga agttaccttc atattccacg 

catgaactct gtgagcgatt tgcccgaatc atgttgtccc tcagtcgaac tcctgctgat 2160 

ggaagataaa ctgcacactt tccctgaaca cactgtataa actcttttta gttcttaacc 2220 

cttgccttcc tgtcacaggg tttgcttgtt gctgctatag tttttaactt ttttttattt 2280 

taataacygc aaargacaaa atgactatac agactttagt cagactgcag acaataaagc 2 3 40 

tgaaaatcgc atggcgctca gacattttaa ccggaactga tgtataatca caaatctaat 2400 

tgattttatt atggcaaaac tatgcttttg ccaccttcct gttgcagtat tactttgctt 2460 

ttatcttttc tttctcaaca gctttccatt cagtctggat ccttccatga ctacagccat 2520 

ttaagtgttc agcactgtgt acgatacata atatttggta gcttgtaaat gaaataaaga 2580 

ataaagtttt atttatggct aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaact 2 640 

cgaggggggg cncaaa ^^^^ 



<210> 34 
<211> 2566 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (2553) 

<223> n equals a,t,g, or c 
<400> 34 

gcaaatagca acttcagtac atcataatat aaatagaaaa aaaagatcag tgcttagatt 



gttaatgttt tgtttttatt tgaattattt tactaacttg tttttgtttt taacctgttc 
tcgctcagag tccctctcct ccccgacagg accctattca ggtttcccct tcttaaagtc 



60 
120 
180 

tcccccagtg aggaactctc tcaacaaggg cccactcctg gtgcagtact atagcttttc 240 
atcccacctc agagtccccc gcaaaaagaa acaagtgatc agagtaccag tcagggtacc 
tcctaaaagc ccagcgatgt cccctccatc cagtccaagg tttcactttt tcaccttttc 
tggtcctttc cccaacagct attaatggta ttatccattc aggtctttct tcaccccagg 42 0 

480 



300 
360 



gcatggtatt tcttacaaac tttctacctc ccactggcta atagctttat tcaagtasaa 600 

ttacacgcca taaaatttac tcattttatt tttttatttt tattaagtta ggttgtgttc 660 

aggatttact ctttttaagt ctgcaattca cttttttttt ggtaaattta gagttgtaca 720 

gtcatcacca tcatccaatt ttagcacatt tccatcacct caaaaagatc cctcatgccc 780 

atttgstgct attccacatt ataaccttcc acccctggca accactaatc tactttgtgt 840 

ctgtatarat tggctttttc tgcatatttc atataaaaat ggaacatata atatttggtc 900 

ttaagtattt ttgaaacata taattttgtt gtggaaatag tagttgattt tatctatgtc 960 

tttatcaggc ctttctctgt attgaatttt cacattgtca ataccactca gaaacagtgg 1020 

ttyatcctac tgcagcaagt tcattgaata ctgttggcac tggaatttat ccctgctgta 1080 

accaaaaggt yctycggttt gatcctactc agcttacaaa gggctgtaaa rtgagggacc 1140 

acatggttac mcttcgtgat caaggtgaag gsggagattt gccgtcctgt cccactgcta 12 00 

gaatgttgga cgatttgcac aagtacagag atgtcattgt tgtgcctttt tcaaaagata 12 60 

cagttagtga tgttggggtt ggcctctgtg atgaaaaggg tatagaatgt gatgttttac 1320 

tggagccaaa tacaccatgg ggtcccaaaa ctggggagct caatgctttc ttgtcattga 13 80 

aaaactggac tctacaactg aaacaacagt cactgttttc agaagaagaa gaatatacca 1440 

ctggatctga ggtcactgaa gatgaagttg gagatgaaga agaagtatcc aagaaacaaa 1500 

ggaaaaagga gaagccaaag aagttcacta gacmaccaaa aaagcaggta tcttcaccct 15 60 

gtgcccagag gaaagaaaag gcattggaga aggtaactct gaattatctg ktgktaaagt 1620 

catatggaaa aataagcatg tgagtatagc cagaaaaaaa taaaaagagt aatgaagaca 1680 

catggaatgc tagcaatgta aaaatgaagt tttttataga ctgagattaa agatctctaa 1740 

gatatattga caaatgagaa aaggaaggtg cagaaacgta tagtggtata gtatgctacc 1800 



^^^^^^^^^^ PCT/US99/05721 
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atttgtgtaa 
ttctggaagg 
gtgtaaggga 
aaaccaatat 
ttttatgata 
gccaggtagg 
tcaaaaatgt 
gtacattcat 
ttgtatcagc 
actgggatta 
gtatcagtta 
cataggtcat 
actcgagggg 



agtagatggg 
ctacataaga 
cagaagtgag 
tatttattca 
tttaaaaata 
aataagagaa 
gtgataggtt 
ctttgtgtta 
tggtcttgag 
taggcgtgag 
gcctgtttat 
ctataacaca 
gggtcccgta 



ggaaatatat 
actcgataaa 
agtcttttca 
aaaaattaaa 
attcttattt 
tagtaataac 
gagtctgggt 
agtagcacag 
cttttgacct 
ccacaatgcc 
tcacgtaaaa 
tataataaag 
cccaattctc 



aaataacttc 
attggttgcc 
ttatatgtgc 
aatagtcttt 
ctccatgcct 
caccattggc 
taagatccac 
gcatataagt 
cgtgatctgc 
tggccacatt 
gttttccacc 
tacatcattg 
ctnacatgca 



cttgtatatg 
tctcaggaag 
cattatacct 
taaattaaaa 
ttgaaggaag 
taaaagaaaa 
agaattacat 
gggttaattc 
ccgcctcagc 
tatgtatttt 
atgtcttatt 
ctgaaaaaaa 
tcgtat 



cataaaatgt 
ggaactgaac 
tttgaatttt 
ataaatcata 
gggtaaaaaa 
actgtgaatt 
tggacacatt 
taaaaaaaaa 
ctcctaaagt 
tttatattct 
atccatggtc 
aaaaaaaaaa 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2566 



<210> 35 
<211> 1668 
<212> DNA 

<213> Homo sapiens 



<400> 35 

aatttcgaac 

ttacaaycta 

attttttaaa 

cccaaggttt 

ttaagrgggc 

gtggaaggag 

ttccagctct 

ataaagcctt 

agtgaatgca 

gttttttttt 

tctagttgtt 

aatctgggtt 

tgaaatcccc 

tccacttgtt 

attgctgttg 

taattttgtt 

aatttttcct 

tctatctatc 

gcctgcagta 

agctaaatgt 

attttctata 

attgctgccg 

aggtactctc 

tgkcatggaa 

gatagcaaga 

gttcaaaaac 

ttagagataa 

taaaataaat 



acccataaaa 

tacatttwaw 

tgatccctac 

tgttccattt 

tgttcattag 

gggcttgttg 

gtggccttgg 

cattagttag 

aggcagctgg 

ttcccttcat 

gttacctaaa 

gtcatggatt 

tttyttaaaa 

tagactaaga 

acatgttaat 

ttaggtcttt 

aacctacagt 

tgaatatgaa 

taggagggac 

ttgttacaaa 

aagctaattc 

ggagactaca 

atttctcaga 

taatgtgact 

actggcagta 

tttccagagt 

ttatttggaa 

actgcattgt 



ttgtaaagaa 

gttttttaac 

cattttttaa 

ctcaattcta 

ggkgcgggct 

tgagcagtgc 

tcaggttacg 

gggcacacac 

tacctttcag 

gttctaagac 

aagsccttgt 

ctttattctt 

tgggcaaagc 

tatgttagag 

actattgctg 

tgtttgaaac 

ttttctctga 

cttctaggat 

aacgtcaact 

tttaagacag 

tgktattttt 

ctgcaaaata 

ataagggggc 

ggtaaataat 

ccaagaaact 

aattaatatg 

tttttatagc 

tgttaaaaaa 



ttgtacagta 

cacttcaaag 

atgatcccta 

gtctgtgaaa 

gggcattatg 

atgagaaaaa 

tctcttcagt 

cgcagtattc 

gtagtagttg 

cagctgagag 

tttaaatttc 

tttttctccc 

aataattcta 

agggaaaggg 

ttgacatgtt 

agtttactgt 

gtacatatgg 

catgtttatt 

actgcatgtg 

tattttaatg 

tgtctctaaa 

gataaagtga 

attcctaaat 

tcattttttc 

atcaagaaag 

taaagccatg 

ttacttcaca 

aaaaaaaaaa 



cattttaaca 

taagtttcag 

ccaaaatgga 

ttgargtctg 

agtgtgtttt 

cggcttggct 

atcgtaactg 

cttaagtcat 

aattcaggta 

gcaaagttgt 

tgtgatacct 

ttaaaaagtt 

catcatttct 

tcgttgtttt 

tactgatggg 

ttttatcagt 

tttcattgtt 

ctagtagatg 

caataacaag 

ccgtttgcat 

ttagggaact 

atgaaatagt 

tttaaaagta 

ttgaatttat 

gagatctgct 

taactaacaa 

atgtgcccag 

aactcgta 



tattkgcttg 

acaccaacac 

aggctggtat 

atgaccactc 

tcatgagkca 

ttgcttcttt 

taatgtggag 

cttgatgaca 

gtattgttca 

accactgagc 

aagaatttca 

acattttaga 

ccccttccct 

agtaaatact 

ctgtgttcca 

tttggtccct 

tgatctactt 

atgacttaaa 

cttgaaggga 

ttttctaaga 

gtccaggttt 

agaaaccaac 

ggkcaactat 

ttatagacct 

actaaacaga 

aggatttgct 

gtcagctgta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1668 



<210> 36 
<211> 983 
<212> DNA 

<213> Homo sapiens 



BNSDOCID: <WO 9946289A1J_> 



wo 99/46289 




PCT/US99/05721 



gagctgttcg taaagtcgcc cgacagcttt ttctccgtag tatgcgagtt gacaaaacag 

ccagagaaca gggctcccca ttacaatctt ttcgagatct tttcccttgc taaccggatc 



60 



<400> 36 

ccgcccgcct gccggccccg gtccggaatt cccgggtcga cccacgcgtc cggggcaagt 

gagcgagctc cttcctcacc gggctgacta gcctctcctt tccctgtccc cctccatcgc 120 

tgctctgcag gaagccagcc cccagggcca gtcccggags ggctgatccg catctacagc 180 

atgaggttct gcccctattc tcacaggacc cgcctcgtcc tcaaggccaa agacatcaga 240 

catgaagtgg tcaacattaa cctgagaaac aagcctgaat ggtactatac aaagcaccct 300 

tttggccaca ttcctgtcct ggagaccagc caatgtcaac tgatctatga atctgttatt 360 

gcttgtgagt acctggatga tgcttatcca ggaaggaagc tgtttccata tgacccttat 420 

gaacgagctc gccaaaagat gttattggag ctattttgta aggtcccaca tttgaccaag 480 

gagtgcctgg tagcgttgag atgtgggaga gaatgcacta atctgaaggc agccctgcgt 540 

caggaattca gcaacctgga agagattctt gagtatcaga acaccacctt ctttggtgga 600 

acctgtatat ccatgattga ttacctcctc tggccctggt ttgagcggct ggatgtgtat 660 

gggatactgg actgtgtgag ccacackcca gcctgcggct ctggatatca gccatgaagt 720 

gggaccccac agtctgtgct cttctcatgg ataagagcat tttccagggc ttcttgaatc 780 

tctattttca gaacaaccct aatgcctttg actttgggct gtgctgagtc tcactgtcca 840 

ccccttcgct gtccagaatt ccccagcttg ttgggagtct acgtcacggc ttgtcttggg 900 

aaccaatccg tctctctttc ttttctttga agttcccaat aaaatgaaaa caggaaaaaa 960 

aaaaaaaaaa aaagggcggc cgc 983 



<210> 37 
<211> 2351 
<212> DNA 
<213> Homo sapiens 

<400> 37 

ccacgcgtcc ggcagaagca gcagcagcag aagacacagc gccggtccag gaggcggctc 



60 
120 
180 

tgatttgtgc gaaaacatgc cttgcacttg tacctggagg aactggagac agtggattcg 240 

acctttagta gcggtcatct acctggtgtc aatagtggtt gcggttcccc tatgcgtgtg 300 

ggaattacag aaactggagg ttggaataca caccaaggct tggtttattg ctggaatctt 360 

tttgctgtga ctattcctat atcactgtgg gtgatattgc aacacttagt gcattataca 420 

caacctgaac tacaaaaacc aataataagg attctttggg atggtaccta tttacagttt 480 

tagatagttg gatagctttg aaatatcccg gaattgcaat atatgtggat acctgcagag 540 

aatgctatga agcttatgta atttacaact ttatgggatt ccttaccaat tatctaacta 600 

accggtatcc aaatctggta ttaatccttg aagccaaaga tcaacagaaa catttccctc 660 

ctttatgttg ctgtccacca tgggctatgg gagaagtatt gctgtttagg tgcaaactaa 720 

gtgtattaca gtacacagtt gtcagacctt tcaccaccat cgttgcttta atctgtgagc 780 

tgcttggtat atatgacgaa gggaacttta gcttttcaaa tgcttggact tatttggtta 840 

taataaacaa catgtcacag ttgtttgcca tgtattgtct cctgctcttt tataaagtac 900 

taaaagaaga actgagccca atccaacctg ttggcaaatt tctttgtgta aagctggtgg 960 

tttttgtttc tttttgattt ggcgtttacc ttttcctaac atataggcaa gcagtagtta 1020 

ttgctttgtt ggtaaaagtt ggcgttattt ctgaaaagca tacgtgggaa tggcaaactg 1080 

tagaagctgt ggccaccgga ctccaggatt ttattatctg tattgagatg ttcctcgctg 1140 

ccattgctca tcattacaca ttctcatata aaccatatgt ccaagaagca gaagagggct 1200 

catgctttga ttcctttctt gccatgtggg atgtctcaga tattagagat gatatttctg 1260 

aacaagtaag gcatgttgga cggacagtca ggggacatcc caggaaaaaa ttgtttcccg 1320 

aggatcaaga tcaaaatgaa catacaagtt tattatcatc atcatcacaa gatgcaattt 1380 

ccattgcttc ttctatgcca ccttcaccca tgggtcacta ccaagggttt ggacacactg 1440 

tgactcccca gactacacct accacagcta agatatctga tgaaatcctt agtgatacta 1500 

taggagagaa aaaagaacct tcagataaat ccgtggattc ctgaacagta tggaaaagca 15 60 

aactgtgcaa ctactacatt atatcattac ctggtatccc atggattttg tgcttgggac 1620 

agaccataaa tgatggaaaa tgtcaacaca aaaatagctg aaagccaggt acaactactg 1680 
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catttatata 
cttgatttct 
tcttcctgat 
gagtcttgct 
ctgcctccca 
gcacctgcca 
ttagccagga 
ggaattacag 
agtacttcct 
agaaatacga 
tatccctatt 
aaaaaaaaaa 



tgtaagtttt 
taacattagg 
gttacactgc 
ctgtcaccca 
agttcatgcc 
ccatgcccag 
tggtcttgat 
gcgtgagcca 
accactgctg 
attagtggaa 
cctcaactga 



gtatatcaaa 
gtatcgcata 
tttatcaaga 
ggctggagtg 
attctcctgc 
ctaatttttt 
ctgacctcgt 
ctgcgcctgg 
catgagtata 
cttaatcatg 
atgtctttca 



aataattggt 
ctcaaatggt 
ggatggactt 
cagtggcgca 
ctcagccctc 
ttttttcagt 
gatccgccga 
ccaagaatgg 
atgctccgga 
tgccatataa 
ataaataaga 



ctaaatttcc 
agacaatgac 
tttttttttt 
atctcgggtc 
ccaagtagct 
agagacaggg 
cctcggcctc 
acatttttta 
attatcagaa 
gcttacctaa 
atttatcatt 



tagacttaga 
cccaactaaa 
gagacagaca 
actgcaagct 
gggactacag 
tctcaccatg 
ccaaagtgct 
aaaaaacatc 
agcataatgc 
caaacagtta 
taaaaaaaaa 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2351 



<210> 38 

<211> 1534 

<212> DNA 

<213> Homo sapiens 

<400> 38 

ccacgcgtcc gcccacgcgt ccggaaatac taaaaattaa atgaaaagtt gtgatgcttg 60 

aagtgctgat tagtattcgg actaaagtat atgaatgaat aacaattttt tctctgcaga 12 0 

gactgcagca tgaaatctca tgctacattg actggtggca gtggttttta tttcatagaa 180 

ctttcttttc tgttgttgag atctgtgctg ttggtgctgg ttctgctttg gcagttccca 240 

aagtccctta caggacaaga atgatgagtg gggatataaa tctcaattcc agcagctgct 300 

cactcacagg tgtctcggtg gaagaattgg gtcttgttga gcctgtagct tctctctata 360 

tactgctggg agatgctgcc tgtgagtgcc ttgcttgata tccaggtgct agggctaagg 42 0 

acctctttgt ggaatagcca tctttgcttg aggtctgtgc aattgtgtat gcctgcagtg 48 0 

cagtgcctgg taaggctttc aaactgtggg caagaatgta acaatgcctg tcactcgtga 540 

agagacacag tcggtgaggt gagtatggat tatgccaagg aaagttttct gggtcagaga 600 

ctttatcctg ctgcaggaat taactccatt gatcaaaaac agccttaatt gggatggggc 660 

tcgggggcaa atttcatatg tgattggcag gagtctaaac tgtatagctt ttctggaggg 72 0 

cattttggca gtggggatta aagtgtcaaa tgtgcatact ctgtgactgg acattttcac 780 

ttcacagaat ttatcctaag gaaagcattg tacaagtata cacaaaaggg tgttcctccg 840 

caccataatg tttagtgttg ccatcacctg gggctttatt aaaaaaggaa aagttacata 900 

aattccagta aaaccataca gtggaatatt ataaagctgc tgaagaagat gaagtcaact 960 

tctatgtact attatggaat gatggtaaag aaattatgta caaaagtcac agatcagcat 1020 

gaatagtgtg atcctatttt caatatatat gtgtgtattg agtgcatatt atgtaaaggt 1080 

ttatatgcat taattttggg aggaagaata tcaaatgcta atagcgatca tcaaatgcta 1140 

atagtgatta tgtaaagacc ctcattttct acttcctact tctctgtatt gtttgaactg 1200 

tttataaagg taaaaccata gtaatttggg ctgggtgcgg tagctcatgc ctgtaatccc 1260 

agcactttgg gaggccaagt ggggtggata tcttgaggtc agttgtttaa gatgagcctg 1320 

accaacatgg tgaaaccctg tctctactaa aaatacaaaa attggttggg cttgatggtg 1380 

tgcacctgtg gtcctaacta cttgggaggc tgaggtggga gaattgcttg aacccaggag 1440 

gtggaggtcg cagtgagctg agattgcacc actgcactcc agcctggatg atagagcaag 1500 
attctctctc aaaaaaaata aaaaaaaaaa aaaa 



<210> 39 

<211> 1182 

<212> DNA 

<213> Homo sapiens 



<400> 39 

agattagagt gataattctt gttctttgtg tattcattta tacagccctg ctccatggac 



1534 



60 



.994fi289A1J_> 
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tactcatgtt 

ctagctgtgg 

tgagagcatt 

tcagcacttt 

tcgctttctc 

gattatctgt 

tgtcaggcac 

aactttcagg 

tttatatttc 

ttaagttcaa 

acataatagt 

taattaaggt 

gtaaactgac 

tcaagttata 

gaggccaagg 

aaaccccatc 

gccgggcatg 

cacttcaacc 

tggaagcctg 



ataataaagg 
ctctacccct 
ttcccccagg 
attattagca 
agtgaagctt 
cattycagag 
ccctgtcatg 
tttaatgctt 
tttgaatttt 
tacttattat 
actaattatg 
tgtaatgtta 
aaacataact 
gagccttagg 
tgggcggatc 
tctactaaac 
gtggtgggca 
caagaggcag 
ggcaatagag 



gatagagaag 
ttttctctca 
gaaaaccttg 
tcagattctt 
tcttcctcat 
tccckgtcct 
taaggcacgt 
gagaacattt 
aggagttgtc 
cctcattctg 
ggccactttg 
tttctggatc 
ccttttcttt 
ccgggtgtgt 
atgaggtcag 
acacacacac 
tctgtggtcc 
aggttgcggt 
caasactcca 



ggcatgatga 
cttaagaaaa 
aattgtgtac 
tgttgaactt 
cgtttccaag 
cccackgagc 
tgggtctgcc 
gaaggctgtt 
taccacaaca 
tggaaaaaat 
gctgaacaca 
tttgaggcat 
ggaaaagatg 
cggctcatgc 
gagtttgaga 
acacacacac 
cggctactcg 
gagccaagat 
tctcaaaaaa 



cgatgtgcgt 

cttcccagaa 

atgtaaatcc 

aatattattc 

ttgttgtgtt 

cacatgcgca 

agagcggcac 

gtctggaaaa 

aataaactag 

atattttcta 

gttttatgct 

tagtagagat 

gatgctgtct 

ctgtggtccc 

tcatcctggc 

acacacacac 

ggaggctgag 

catgccattg 

aa 



tcccagtgtg 
acccgagaag 
atgggaatct 
ttctttattt 
tcggtaayck 
cacacatctc 
cccttgttcc 
gataagtgtt 
atcacacttt 
ttaatcatgt 
taggcttaca 
cactgatgaa 
gctaaactaa 
ggcactttgg 
caacatggtg 
acacacacac 
acaggagtgt 
cactccagcc 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1182 



<210> 40 

<211> 1841 

<212> DNA 

<213> Homo sapiens 



' <400> 40 
cgacccacgc 
atagacatga 
ctttgtattt 
gttttattca 
ctcaataaac 
ccatcacgga 
ccgaggtgtt 
ctgggggcag 
cgagtggtga 
aaaaattttg 
cggtagaata 
ttgtaatcat 
tatcagtttg 
acaatgacct 
tattgacttg 
tcacactcag 
ggtttgtgct 
ggaaacagcc 
cacgcctgta 
ttcgagacca 
ctgggcgtgg 
acttgaaccc 
gggtaacaga 
tgtgttctat 
ttagatatat 
tttcggggct 
tgtaatccca 
cagcctgacc 



gtccgcacct 

tgtcaccatc 

taatcacttt 

ttttttagca 

atttgtctaa 

ctgcccttaa 

agtagcacca 

agatggcaga 

caaacctcca 

cttcatgaaa 

cattgtccca 

ggatgcacac 

tggttgttac 

tctgaactcc 

gatagaagcc 

tgcctggtag 

gttttcatgc 

tcacattggc 

atcccagcac 

gcctggccaa 

tggtgggtgc 

gggagatgga 

gtaagactct 

agatgtgcac 

atttttacat 

ctctgggttt 

gcactttggg 

aacatggtga 



gtcccctctg 

tggaatcatt 

catgcttgcc 

tttggtagca 

ttaaactaaa 

gtctttccct 

cggactcttc 

gggagcacat 

cagggtcaca 

atgcattatg 

cttggttaaa 

cacaagggaa 

ttccaataga 

gttttactga 

aacataaatc 

cctgtgtgtt 

agagatagca 

aagtgtgaaa 

tttgggagga 

catggtgaaa 

ctgtagtctc 

ggttgcagtg 

gtctcaaaaa 

ctgagtgtag 

tccttgtgaa 

tggcattgaa 

aggccgaggt 

aaccccatct 



tgagctctgt 

gtgtacgtct 

atcccttcta 

tttagcacta 

acaggaggtc 

gaacagaaat 

cactttttca 

gggaaccggg 

gatgttggac 

cattactttt 

tgtgataaaa 

aagcacttgt 

aatgcttcga 

ctgtttaaaa 

agacagtgtc 

ggagggattg 

acagtaacac 

ggagccaaaa 

tgaggtgggt 

ccccatctct 

agctactcag 

agccaagatt 

aaaaaaaaaa 

gaagaaattt 

cactggcatt 

aatctcttgg 

gggcagatca 

ctactaaaaa 



actgttctcc 

ctgctactcc 

ttcataatgg 

atctgtccaa 

aggtcatttc 

tagcaaattg 

ccttggcaat 

caaatgtgac 

atgataaatt 

acatgaatag 

ggagattagt 

tccttctgcc 

aagatgaccc 

taatcctgca 

cctgaacaaa 

gcggcagctt 

gactaagtga 

tatggccagg 

ggatcatttg 

actaaaactg 

gaggctgaga 

gcaccactgt 

aaaaaaaggg 

aatatttagg 

aatggatagg 

ctgggtgcga 

tgagttcagg 

taaaaaaaaa 



gtccctgcaa 

tcacatcctg 

cagagtttgt 

ataatgaatg 

acctttttcc 

aagtaaggaa 

gggaaacatc 

taagagacag 

ttgcttcatg 

ctaaattgaa 

ggacttgaat 

tcgtcactag 

aagggctcca 

gcttcagatg 

actgaatact 

ctctgctcct 

ccatggctag 

catggtggct 

aggtcaggag 

caaaaattgg 

caggagaatc 

actccagcct 

ggagccaaac 

gagaaaaatg 

gaaccttggt 

tgctcacgcc 

agttcaagac 

attagccagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
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catggtggcg ggcgcctgta atcccagcta ctcaggaggc tgaggcagga gaatcacttg 1740 
aacccgggag gcggaggttg cagtgagctg agattgcagc attgcatccc agcctgggtg 1800 
acagagcgag actccatctc aaaaaaaaaa aaaaaaaaaa a 1841 



<210> 41 

<211> 1197 

<212> DNA 

<213> Homo sapiens 



<400> 41 

cccacgcgtc 

ttttgtgtga 

cttatttgtt 

aatgcatgtc 

agatgctctt 

gaactcctaa 

gcttttctcc 

cttttcctta 

tgtcatctgc 

ataaatcaga 

ttcataaaag 

ataggcgatt 

caagttactt 

ctcccccagt 

ttaaatgagg 

gattttgata 

actccagacc 

agttacctaa 

gttactttca 

tccagaactt 



cgattgggaa 
tgtcactgtt 
actgtgctta 
cattgttttg 
tcattctgta 
gaccagtgca 
cataacttat 
tttttcatag 
tagcaggcca 
tctggaaacg 
ccaatttatt 
tgtctattat 
tatgtaaatt 
tttctaactg 
taaaatatag 
aatacaccca 
ttcctgtgtt 
gccgtaggca 
tcagaaggct 
caagagaact 



aaggctgtcg 
ccatgttgaa 
gtgatgtgtc 
ctatacatgt 
tattttgcta 
gtgctccaag 
aaagcagaag 
ttattttctt 
ttgttcgagg 
aagttagaga 
atggctactc 
tcactctttg 
acttggagtt 
gctttgattt 
attcggtgag 
tgtaactgcc 
tcagtagctt 
attttatgtg 
caaagtaatg 
ctctcaaaga 



ttaatcactt 
gagtcatgga 
atatttgcag 
tttatttcat 
tggaccacag 
tatctatgaa 
gtagttttct 
attaagttat 
ttaaaataca 
tttttgcaca 
ttaacaattc 
tttttataat 
ttatttactg 
ttgtttctta 
atgcacagat 
agctgaatca 
tgttcacatg 
attctgcata 
gacctgaaaa 
aagaaaagaa 



ttagcagcag 
gatgtacaaa 
caaatacaaa 
ttctgctcca 
accctcattg 
tcaaatggca 
ttcccatcac 
ctgtaaaaat 
aattaagaag 
aacataatac 
ctttaaagtt 
tttgttaggt 
aaaatcagat 
gcttgattgc 
cttgagtgtg 
agataaagac 
ttcggcagca 
gtagtcaata 
gcaggaaaaa 
ggaaaaaaaa 



aaatttttta 
atgtattgac 
aaaagttaag 
caatttcagc 
acatgtattg 
gtgttcacat 
aatagccatt 
aatgcatctc 
aagcaagcaa 
ttacaacagt 
aaaaactact 
ttcttttatt 
ttcatcattt 
ttgctagttt 
cagttcaatt 
cattctcatt 
tgtgagacga 
tggtgataat 
gaaggggtta 
aaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1197 



<210> 42 

<211> 602 

<212> DNA 

<213> Homo sapiens 



<400> 42 
aattcggcac 
gtcatgtttt 
ttagcagaca 
ttcagcatgt 
gttcgtatct 
cactgctttg 
agtgcaggag 
tttcggggac 
taccaggaac 
gtccccaggc 
ag 



agkttgtgtt 
cctcatgtgc 
acaaacctgt 
catacctgga 
ggaaggtggg 
ggcacagcgc 
aggattgtgt 
accaggatgt 
ttgcatatct 
cagaataaag 



tctmatgttc 
tgtttccaac 
agcacctgac 
aagcaaggga 
cgacctgcga 
ccgtgtgtgg 
ctgcttggtg 
gtacccggtt 
agagacagag 
aatagagtgt 



caggtccggc 
cagctcttgg 
cgacgaatca 
ttgctggcta 
gtgcctgggg 
caggtcaagc 
tggagccatg 
gtagtaggag 
actgagtcac 
aaaaaaaaaa 



caggctggca 
tctggtaccc 
gtgggcatgt 
cagyttcaga 
gtcgggtgca 
ttctagagaa 
aaggtgagat 
ctgaaatcca 
tggcccatct 
aaaaaaaaaa 



gctcctgctg 
agcaactgcc 
gggcatcatc 
agaccgaagc 
gaatattggg 
ttaccttatc 
cctccaggcc 
tgctgagctg 
ctttgctctt 
aaaaaactcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
602 



<210> 43 
<211> 2492 
<212> DNA 
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<213> Homo sapiens 
<400> 43 

ccacgcgtcc ggaggaagga tgatgatgaa ggaccgtaca caccattcga caccccctcg 60 
ggtaaactgg aaacagtgaa atgggcgttc acctggccgc tgagtttcgt cttatacttc 120 
- actgtaccca actgcaacaa gccgcgctgg gagaaatggt tcatggtgac gtttgcttcc 180 
tccacgctgt ggatcgcagc cttctcctac atgatggtgt ggatggtcac aatcattggt 240 
tacaccctgg ggattcctga cgtcatcatg ggggatcacc ttcctggctg ctgggaccag 
cgtgcctgac tgcatggcca gcctcattgt ggccagacaa gggatggggg acatggctgt 
gtccaactcc attgggagca acgtgtttga catcctgatt ggcctcggtc tcccctgggc 
tctgcagacc ctggctgtgg attacggatc ctacatccgg ctgaatagca gggggctgat 
ctactccgta ggcttgctcc tggcctctgt ttttgtcacg gtgttcggcg tccacctgaa 
caagtggcag ctggacaaga agctgggctg tgggtgcctc ctcctgtatg gtgtgttcct 
gtgcttctcc atcatgactg agttcaacgt gttcaccttt gtgaacctgc ccatgtgcgg 
ggaccactga gccgccgggt gcccacagaa gctcagctcc ttcttttctg tgcaatacga 
gacccggccg caccccgagt cacacaggcc cctggggcca cggcgttcgt ctctcctgtg 
ctgtcctcag gcctccgctc ctgttttggt ggcccaggct ctcccctgcc ccatcctcgc 
tcccccacct ccttgggtca tgcccaccca ccctttcctg cctcctccgt gtgaagacat 
ccaacatcca cgtgactttt ccagctccat . ttttgaacag tgactgagat tctagaaaaa 
ctggctgcta actggcctga gccaggcaac actgattcca atccctcctc cttttttaag 
ttatttgatg gaagactcac ctaatttgtg acctgagact gttgaagaaa tagagaggag 
ggggcccgtt gattacagag agcatttggg attttgtttg gtttggagat gatgcctagg 1140 
ttactgggtt tggggggatt gttttctttt gggggccttc cccttttact ccttttcttc 1200 
cagagatcaa gagcttctct tgcatcttct tccactgggc tctggattaa tcaattaccc 12 60 

aaaggctgca cctgccgtgt tgtctgggct tgcatcccag atgtgttgga gtatgcatgg 1320 
atgtagtgct ttttagagga gccactgggc aaggccacca agaacaaatg catgacattt 1380 
tatagccaag gacgcctcac taaagtctta tgggcgtccc ctggggttgg gggggcacaa 1440 
ggttttggag gaagaagaca acttccctca ttccatcatc accatctctt tctcactagg 1500 
ttctttctag ttttcaaagc aataagtcta gcctgccttg gacaaggggg cccccagtta 1560 
aacaaactac ccatccatga ggtgccaggc agtcaaaaaa cagaagcttc cccgattgtg 1620 
agtccatgag atgtgctctt gttgtaaggc atttggggtg acagggagtg acccagaggc 1680 
caccactgct tttcatgcag gagttacaga cactggtttt cttggaaaat ggagagaagc 17 40 

gcactttgca cagacgtcgt caattaagtc ccaatttgcc acttggtatt gagtacactg 1800 
gaccctgacc actggctttt gggcaaacgt cttcctcacg gggcgcttcc gccaagccgg I860 
cccagctgca cccctccctt cctggaggga tggccaggga aggagaaaac agagaactga "^^^^ 
cacttttgaa accacagaat gtgtaacatg cagatcgctc aagggcataa gttattgtga 
acgtttttgc caatcactgc tcaacagccc tgctagattt tgtatgatgc tgaattatta 
tgcagactaa ttccacccag ttgagacaca ccatgcttgt tcacttgtat ttattgaaac 
tgtggattct tgcccgtgct gtcccttgta tttactttaa gcactgatca cttatcattc 
attcggtatg gttttccctg tcccttgtac acattctggt atgaatttgt aaaaataccc 
tactacaaat tggttgaatg tttctgtctg tggtgcgaac cagcattaac ggatggggca 2280 
cgtgcccaac tgaggaacag gagaagaaat ccccaatttg ggctctcaga gctaagacac ^"^^^ 
acttattgat tctgttgcac attttgcact ggtttatggc gattgttttc ttggacggat 
agtgtaaaat aaacttctct gttctctaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 



300 

360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



1980 
2040 
2100 
2160 

2220 



2400 
2460 
2492 



<210> 44 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (75) 
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<223> Xaa equals stop translation 
<400> 44 

Met Leu His Leu Ala Ala Met Trp Trp Ala Cys Val Thr Thr Leu Val 
15 10 15 

Phe Thr Leu Val Ser Lys Leu Phe He Pro Leu Lys Ser Ser Met Asp 
20 25 30 

Gly Glu Met Ser Leu Asp Pro His Ser Cys Val Leu Val Cys He Cys 
35 40 45 

Phe Pro Leu Arg Phe Val Phe Val Ser Cys Phe Glu Leu Tyr Leu Val 
50 55 60 

Gin Ser He Val Lys Leu Ser Gin Gin Leu Xaa 
65 70 75 



<210> 45 
<211> 78 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<2^22> (78) 

<223> Xaa equals stop translation 



<400> 45 

Met Asp Ala Phe Ala Gly Ser Pro 
1 5 

Val Leu He Leu Phe Cys Leu Val 
20 

Tyr Ser Ser Leu Leu Asn Ser Ala 

35 40 

Lys Leu Ala Asn Leu Leu Leu He 
50 55 

He Pro Phe Ser Ser Ser His Pro 
65 70 



Phe Ser Leu Met Val Pro Lys Cys 
10 15 

Tyr Ser Leu Gin Cys He Gin Pro 
25 30 

Ser Leu Pro Tyr His His Gly Leu 
45 

Val Phe Tyr Pro His He His Ser 
60 

Ser Lys Leu His He Xaa 
75 



<210> 46 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals stop translation 



9 

BNSDOCID: <WO 9946289A1_L> 
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<400> 46 

Met Asp Leu Leu Gin Val Cys Phe Phe Leu Phe Phe Ser His Leu Trp 
15 10 15 

Ser Trp Thr Glu Gly Lys Leu Pro Cys Asn Phe Pro Gly Pro Val Gly 
20 25 30 

Arg Val Phe Leu Ser Pro Phe Gin Met Leu Gly Phe Lys Gin Xaa 
35 40 45 



<210> 47 

<211> 102 

<212> PRT 

<213> Homo sapiens 

<220> 
<221> SITE 
<222> (102) 

<223> Xaa equals stop translation 
<400> 47 

Met Ala Phe Trp Phe Thr Gly Leu Pro Leu Leu Ser Leu lie Leu Leu 
15 10 15 

Cys He Gly Arg Val Phe Leu Gly Val Gly Glu Ser Phe Ala Ser Thr 
20 25 30 

Gly Ser Thr Leu Trp Gly He Gly Leu Val Gly Pro Leu His Thr Ala 
35 40 45 

Arg Val He Ser Trp Asn Gly Val Ala Thr Tyr Gly Ala Met Ala Ala 
50 55 60 

Gly Ala Pro Leu Gly Val Tyr Leu Asn Gin His Trp Gly Leu Ala Gly 
65 70 75 80 

Val Ala Ala Leu He Val Leu Ala Val Ala Val Ser Leu Trp Leu Ala 
85 90 95 

Ser Ala Asn Pro Thr Xaa 
100 



<210> 48 

<211> 382 

<212> PRT 

<213> Homo sapiens 

<220> 
<221> SITE 
<222> (67) 

<223> Xaa equals any of the naturally occurring L-aonino acids 
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<220> 

<221> SITE 
<222> (139) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (141) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> 

<221> SITE 
<222> (165) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> 

<221> SITE 
<222> (194) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (344) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (361) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (382) 

<223> Xaa equals stop translation 
<400> 48 

Met Phe Gin Val Arg Pro Gly Trp Gin Leu Leu Leu Val Met Phe Ser 
1 5 10 15 

Ser Cys Ala Val Ser Asn Gin Leu Leu Val Trp Tyr Pro Ala Thr Ala 
20 25 30 

Leu Ala Asp Asn Lys Pro Val Ala Pro Asp Arg Arg He Ser Gly His 
35 40 45 

Val Gly He He Phe Ser Met Ser Tyr Leu Glu Ser Lys Gly Leu Leu 
50 55 60 

Ala Thr Xaa Ser Glu Asp Arg Ser Val Arg He Trp Lys Val Gly Asp 
^5 70 75 80 

Leu Arg Val Pro Gly Gly Arg Val Gin Asn He Gly His Cys Phe Gly 
85 90 95 

His Ser Ala Arg Val Trp Gin Val Lys Leu Leu Glu Asn Tyr Leu He 



BNSDOCID: <WO &946289A1 I > 



wo 99/46289 PCT/US99/05721 

30 

100 105 110 

Ser Ala Gly Glu Asp Cys Val Cys Leu Val Trp Ser His Glu Gly Glu 
115 120 125 

lie Leu Gin Ala Phe Arg Gly His Gin Gly Xaa Gly Xaa Arg Ala lie 
130 135 140 

Ala Ala His Glu Arg Gin Ala Trp Val lie Thr Gly Gly Asp Asp Ser 
145 150 155 160 

Arg His Arg Leu Xaa His Leu Val Gly Arg Gly Tyr Arg Gly Leu Gly 
165 170 175 

Val Ser Ala Leu Cys Phe Lys Ser Arg Ser Arg Pro Gly Thr Leu Lys 
180 185 190 

Ala Xaa Thr Leu Ala Gly Ser Trp Arg Leu Leu Ala Val Thr Asp Thr 
195 200 205 

Gly Ala Leu Tyr Leu Tyr Asp Val Glu Val Lys Cys Trp Glu Gin Leu 
210 215 220 

Leu Glu Asp Lys His Phe Gin Ser Tyr Cys Leu Leu Glu Ala Ala Pro 
225 230 235 240 

Gly Pro Glu Gly Phe Gly Leu Cys Ala Met Ala Asn Gly Glu Gly Arg 
245 250 255 

Val Lys Val Val Pro lie Asn Thr Pro Thr Ala Ala Val Asp Gin Thr 
260 265 270 

Leu Phe Pro Gly Lys Val His Ser Leu Ser Trp Ala Leu Arg Gly Tyr 
275 280 285 

Glu Glu Leu Leu Leu Leu Ala Ser Gly Pro Gly Gly Val Val Ala Cys 
290 295 300 

Leu Glu lie Ser Ala Ala Pro Ser Gly Lys Ala lie Phe Val Lys Glu 
305 310 315 320 

Arg Cys Arg Tyr Leu Leu Pro Pro Ser Lys Gin Arg Trp His Thr Cys 
325 330 335 

Ser Ala Phe Leu Pro Pro Gly Xaa Phe Leu Val Cys Gly Asp Arg Arg 
340 345 350 

Gly Ser Val Leu Leu Phe Pro Ser Xaa Pro Gly Leu Leu Lys Asp Pro 
355 360 365 

Gly Val Gly Gly Lys Ala Arg Ala Gly Ala Gly Ala Leu Xaa 
370 375 380 



<210> 49 
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<211> 46 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (46) 

<223> Xaa equals stop translation 



<400> 49 

Met Gin Lys Lys 

1 

Phe Leu Val Val 
20 

His Tyr Pro Arg 
35 



Lys Leu Val Cys 
5 

Val Ser Leu Pro 

Thr Val Thr Pro 
40 



Tyr Leu Met Leu 
10 

Trp Pro Cys Val 
25 

Thr Leu Thr Glu 



Arg Gin Tyr Phe 
15 

Leu Phe Gin Met 
30 

Tyr Xaa 
45 



<210> 50 

<211> 168 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (60) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (64) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (132) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 50 

Met Val Thr Phe Ala Ser Ser Thr Leu Trp He Ala Ala Phe Ser Tyr 
^5 10 15 

Met Met Val Trp Met Val Thr He He Gly Tyr Thr Leu Gly He Pro 
20 25 30 

Asp Val He Met Gly He Thr Phe Leu Ala Ala Gly Thr Ser Val Pro 
35 40 45 

Asp Cys Met Ala Ser Leu He Val Ala Arg Gin Xaa Met Gly Asp Xaa 
50 55 60 

Ala Val Ser Asn Ser He Gly Ser Asn Val Phe Asp He Leu He Gly 
^5 70 75 80 
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Leu Gly Leu Pro Trp Ala Leu Gin Thr Leu Ala Val Asp Tyr Gly Ser 
85 90 95 

Tyr lie Arg Leu Asn Ser Arg Gly Leu lie Tyr Ser Val Gly Leu Leu 
100 105 110 

Leu Ala Ser Val Phe Val Thr Val Phe Gly Val His Leu Asn Lys Trp 
115 120 125 

Gin Leu Asp Xaa Lys Leu Gly Cys Gly Cys Leu Leu Leu Tyr Gly Val 
130 135 140 

Phe Leu Cys Phe Ser lie Met Thr Glu Phe Asn Val Phe Thr Phe Val 
145 150 155 160 

Asn Leu Pro Met Cys Gly Asp His 
165 



<210> 51 
<211> 50 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (50) 

<223> Xaa equals stop translation 
<400> 51 

Met Thr Ser Val Pro Leu Ala Thr Phe Ser Val Leu Thr lie Ala Leu 
15 10 15 

Arg Ala Gin Val Leu Lys Leu Val Val Leu Ser Phe Val Ser Ala Phe 
20 25 30 

Ser Pro Val His Tyr Pro Pro Pro Leu Leu Leu Lys Gin Ser Arg Leu 
35 40 45 

Asn Xaa 
50 



<210> 52 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals stop translation 



<400> 52 



• 
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Met Leu Cys Asp Leu He Leu Leu Phe Asn He Lys Met Ala He Tyr 

15 10 15 

His Leu He He Leu Gin Phe Phe Cys Ser Val Cys Ser Glu Pro Asp 

20 25 30 

Thr Ala Leu Ser He Ser Pro Leu Xaa 

35 40 



<210> 53 
<211> 95 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (95) 

<223> Xaa equals stop translation 



<400> 53 
Met Leu Leu Ser 
1 



Phe Pro Cys Ala 
20 

Gly Glu Glu Ser 
35 



Ser He Gin Asn 
50 



Arg Phe Ser Glu 
65 



Pro He Cys Ser 



Phe Tyr Cys Leu 

5 

His Cys Val Tyr 



Phe Asn Arg Asp 
40 

Val Asn Ser Thr 
55 

Arg Phe Ser Asp 
70 

Glu Lys Asn Gly 
85 



Pro Met Val Ser 
10 

Leu Leu His He 
25 

Thr Cys Lys Lys 



Phe Leu Leu Ser 
60 

Ser Asn Phe Leu 
75 

Leu Leu Tyr His 
90 



He His He Phe 
15 

Ser Cys Ser Leu 
30 

Asp Phe Cys Phe 
45 

Leu Ala Val Phe 



Phe Thr Thr Pro 
80 

Trp He Xaa 
95 



<210> 54 
<211> 485 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (322) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (345) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<220> 

<221> SITE 
<222> (374) 

<223> Xaa equals any of the naturally occurring L-amino acids 



<220> 

<221> SITE 
<222> (485) 

<223> Xaa equals stop translation 



<400> 54 

Met Val Ala Thr Val Cys Gly Leu Leu Val Phe Leu Ser Leu Gly Leu 
15 10 15 

Val Pro Pro Val Arg Cys Leu Phe Ala Leu Ser Val Pro Thr Leu Gly 
20 25 30 

Met Glu Gin Gly Arg Arg Leu Leu Leu Ser Tyr Ser Thr Ala Thr Leu 
35 40 45 

Ala lie Ala Val Val Pro Asn Val Leu Ala Asn Val Gly Ala Ala Gly 
50 55 60 

Gin Val Leu Arg Cys Val Thr Glu Gly Ser Leu Glu Ser Leu Leu Asn 
65 70 75 80 

Thr Thr His Gin Leu His Ala Ala Ser Arg Ala Leu Gly Pro Thr Gly 
85 90 95 

Gin Ala Gly Ser Arg Gly Leu Thr Phe Glu Ala Gin Asp Asn Gly Ser 
100 105 110 

Ala Phe Tyr Leu His Met Leu Thr Val Thr Gin Gin Val Leu Glu Asp 
115 120 125 

Phe Ser Gly Leu Glu Ser Leu Ala Arg Ala Ala Ala Leu Gly Thr Gin 
130 135 140 

Arg Val Val Thr Gly Leu Phe Met Leu Gly Leu Leu Val Glu Ser Ala 
145 150 155 160 

Trp Tyr Leu His Cys Tyr Leu Thr Asp Leu Arg Phe Asp Asn lie Tyr 
165 170 175 

Ala Thr Gin Gin Leu Thr Gin Arg Leu Ala Gin Ala Gin Ala Thr His 
180 185 190 

Leu Leu Ala Pro Pro Pro Thr Trp Leu Leu Gin Ala Ala Gin Leu Arg 
195 200 205 

Leu Ser Gin Glu Glu Leu Leu Ser Cys Leu Leu Arg Leu Gly Leu Leu 
210 215 220 

Ala Leu Leu Leu Val Ala Thr Ala Val Ala Val Ala Thr Asp His Val 
225 230 235 240 
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Ala Phe Leu Leu Ala Gin Ala Thr Val Asp Trp Ala Gin Lys Leu Pro 
245 250 255 

Thr Val Pro He Thr Leu Thr Val Lys Tyr Asp Val Ala Tyr Thr Val 
260 265 270 

Leu Gly Phe He Pro Phe Leu Phe Asn Gin Leu Ala Pro Glu Ser Pro 
275 280 285 

Phe Leu Ser Val His Ser Ser Tyr Gin Trp Glu Leu Arg Leu Thr Ser 
290 295 300 

Ala Arg Cys Pro Leu Leu Pro Ala Arg Arg Pro Arg Ala Ala Ala Pro 

310 315 320 

Leu Xaa Ala Gly Gly Leu Gin Leu Leu Ala Gly Ser Thr Val Leu Leu 
325 330 335 

Glu Gly Tyr Ala Arg Arg Leu Arg Xaa Ala He Ala Ala Ser Phe Phe 
340 345 

Thr Ala Gin Glu Ala Arg Arg He Arg His Leu His Ala Arg Leu Gin 
355 360 365 

Arg Arg His Asp Arg Xaa Gin Gly Gin Gin Leu Pro Leu Gly Asp Pro 
370 375 380 

Ser Cys Val Pro Thr Pro Arg Pro Ala Cys Lys Pro Pro Ala Trp He 
385 390 395 400 

Ala Tyr Arg Leu Asp Ala Leu Arg Thr Glu Ser Ser Glu Gly Glu Gly 
405 410 415 

Lys Glu Leu Trp Ser Cys Arg Asp Leu Ser Cys His Leu Gly Pro Val 
420 425 430 

Pro Pro Pro Cys Val Thr Leu Gly Lys Ser Leu His Leu Ser Glu Pro 
435 440 445 

Arg Phe Leu His Leu His Asn Asp Ser He Phe Thr He Asp Val Thr 
450 455 460 

Tyr Phe Pro Arg Arg Asp Val Val Arg Met Glu Gly Asn Thr Gly His 

470 475 480 

Asp Arg Pro Gly Xaa 
485 



<210> 55 
<211> 115 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (115) 

<223> Xaa equals stop translation 

<400> 55 

Met Pro lie His Lys Thr Lys lie Ser Cys Val Plie Leu Leu Leu Ser 
15 10 15 

Leu Lys Trp His Trp Met Thr Asn Gly Lys Leu Asp Ala Ala Leu Asn 
20 25 30 

Val Pro Leu Gly Phe Arg Gly Phe Gin Ser Gin Trp Thr Gly Gly Gly 
35 40 45 

Leu Cys Gin Cys Leu Ser Gly Val Cys Leu Cys His Cys Gly Ala Ala 
50 55 60 

Trp Ala Thr Asp Leu Gly Arg Thr Leu Gly Asp Gly Ala Pro Val. Trp 
65 70 75 80 

Trp Val Cys Val Gly Ser Ala Val Pro Val His Val Arg Lys Ala Leu 
85 90 95 

Leu Leu Tyr Thr Glu Ser Cys Ser Leu Ser Thr Thr Asp Arg Ser Pro 
100 105 110 

Leu Pro Xaa 
115 



<210> 56 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (50) 

<223> Xaa equals stop translation 
<400> 56 

Met Ser Arg Ala Pro Cys Ala Ser Ser lie Leu Val Leu Thr Leu lie 
15 10 15 

Val Thr Leu Leu Val Leu Leu Cys Ser Val Lys lie Cys Asn Trp Leu 
20 25 30 

Arg lie Thr Val Gly Val His Ser Tyr Ser Thr Lys Ser Pro Gin Val 
35 40 45 

Phe Xaa 
50 



9 
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<210> 57 

<211> 172 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (172) 

<223> Xaa equals stop translation 

<400> 57 

Met Lys Lys Cys Leu Leu Pro Val Leu lie Thr Cys Met Gin Thr Ala 
1 5 10 15 

lie Cys Lys Asp Arg Met Met Met lie Met He Leu Leu Val Asn Tyr 
20 25 30 

Arg Pro Asp Glu Phe He Glu Cys Glu Asp Pro Val Asp His Val Gly 
35 40 45 

Asn Ala Thr Ala Ser Gin Glu Leu Gly Tyr Gly Cys Leu Lys Phe Gly 
50 55 60 

Gly Gin Ala Tyr Ser Asp Val Glu His Thr Ser Val Gin Cys His Ala 
65 70 75 80 

Leu Asp Gly He Glu Cys Ala Ser Pro Arg Thr Phe Leu Arg Glu Asn 
85 90 95 

Lys Pro Cys He Lys Tyr Thr Gly His Tyr Phe He Thr Thr Leu Leu 
100 105 110 

Tyr Ser Phe Phe Leu Gly Cys Phe Gly Val Asp Ajrg Phe Cys Leu Gly 
115 120 125 

His Thr Gly Thr Ala Val Gly Lys Leu Leu Thr Leu Gly Gly Leu Gly 
130 135 140 

He Trp Trp Phe Val Asp Leu He Leu Leu He Thr Gly Gly Leu Met 

150 155 160 

Pro Ser Asp Gly Ser Asn Trp Cys Thr Val Tyr Xaa 
165 170 



<210> 58 

<211> 125 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> <101) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<400> 58 
Met Leu Ser Gin 
1 

Ser Leu Gly Leu 
20 

Ser Phe Phe Ala 
35 

Gly lie Leu Lys 
50 

Asp Asn Pro Met 
65 

lie Met His Ala 



Leu Thr Tyr lie 
100 

lie Tyr Arg Gin 
115 



Pro Arg Met Glu 
5 

Ala Leu Leu Gly 



Leu Gly Phe Ala 
40 

Glu Ala Arg Val 
55 

Tyr Trp Gly Ser 
70 

Ser Pro Thr Gly 
85 

Xaa Ala Leu Leu 



Lys Ala Ser Gly 
120 



Ser Leu Asp Thr 
10 

Leu Gly Val Val 
25 

Gly Thr Phe Leu 



Thr Val Phe Pro 
60 

Thr Ala Asn Tyr 
75 

Leu Leu Leu Thr 
90 

Tyr Glu Glu Pro 

105 

Ser His Lys Arg 



Pro Ala Ala Tyr 
15 

Leu Val Leu Ser 
30 

Gly Asp Tyr Phe 
45 

Phe Asn lie Leu 



Leu Gly Trp Ala 
80 

Val Leu Val Ala 
95 

Phe Thr Ala Glu 
110 

Ser 
125 



<210> 59 
<211> 311 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (142) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (311) 

<223> Xaa equals stop translation 
<400> 59 

Met Leu Leu Trp Leu Leu Gly Trp Leu Glu Cys Val His Asn Ser Arg 
15 10 15 

Arg Ser Gin Gly Leu Pro Pro His Tyr Asp Asp Val Glu Val Phe lie 
20 25 30 

Leu Gin Leu Glu Gly Glu Lys His Trp Arg Leu Tyr His Pro Thr Val 
35 40 45 

Pro Leu Ala Arg Glu Tyr Ser Val Glu Ala Glu Glu Arg lie Gly Arg 

50 55 60- 



Pro Val His Glu Phe Met Leu Lys Pro Gly Asp Leu Leu Tyr Phe Pro 
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65 



70 



75 



80 



Arg Gly Thr lie His Gin Ala Asp Thr Pro Ala Gly Leu Ala His Ser 
85 90 95 

Thr His Val Thr He Ser Thr Tyr Gin Asn Asn Ser Trp Gly Asp Phe 
100 105 110 

Leu Leu Asp Thr He Ser Gly Leu Val Phe Asp Thr Ala Lys Glu Asp 
115 120 125 

Val Glu Leu Arg Thr Gly He Pro Arg Gin Leu Leu Leu Xaa Val Glu 
130 135 140 

Ser Thr Thr Val Ala Thr Arg Arg Leu Ser Gly Phe Leu Arg Thr Leu 

150 155 160 

Ala Asp Arg Leu Glu Gly Thr Lys Glu Leu Leu Ser Ser Asp Met Lys 
165 170 175 

Lys Asp Phe He Met His Arg Leu Pro Pro Tyr Ser Ala Gly Asp Gly 
180 185 190 

Ala Glu Leu Ser Thr Pro Gly Gly Lys Leu Pro Arg Leu Asp Ser Val 
195 200 205 

Val Arg Leu Gin Phe Lys Asp His He Val Leu Thr Val Leu Pro Asp 
210 215 220 

Gin Asp Gin Ser Asp Glu Ala Gin Glu Lys Met Val Tyr He Tyr His 
225 230 235 240 

Ser Leu Lys Asn Ser Arg Glu Thr His Met Met Gly Asn Glu Glu Glu 
245 250 255 

Thr Glu Phe His Gly Leu Arg Phe Pro Leu Ser His Leu Asp Ala Leu 
260 265 270 

Lys Gin He Trp Asn Ser Pro Ala He Ser Val Lys Asp Leu Lys Leu 
275 280 285 

Thr Thr Asp Glu Glu Lys Glu Ser Leu Val Leu Ser Leu Trp Thr Glu 
290 295 300 

Cys Leu He Gin Val Val Xaa 
305 310 



<210> 60 
<211> 164 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 



99462S9A1 1. 



# 
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<222> (2) 

<22 3> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (164) 

<223> Xaa equals stop translation 
<400> 60 

Met Xaa Gly Leu Leu Leu Ala Ala Phe Leu Ala Leu Val Ser Val Pro 
15 10 15 

Arg Ala Gin Ala Val Trp Leu Gly Arg Leu Asp Pro Glu Gin Leu Leu 
20 25 30 

Gly Pro Trp Tyr Val Leu Ala Val Ala Ser Arg Glu Lys Gly Phe Ala 
35 40 45 

Met Glu Lys Asp Met Lys Asn Val Val Gly Val Val Val Thr Leu Thr 
50 55 60 

Pro Glu Asn Asn Leu Arg Thr Leu Ser Ser Gin His Gly Leu Gly Gly 
65 70 75 80 

Cys Asp Gin Ser Val Met Asp Leu lie Lys Arg Asn Ser Gly Trp Val 
85 90 95 

Phe Glu Asn Pro Ser lie Gly Val Leu Glu Leu Trp Val Leu Ala Thr 
100 105 110 

Asn Phe Arg Asp Tyr Ala lie lie Phe Thr Gin Leu Glu Phe Gly Asp 
115 120 125 

Glu Pro Phe Asn Thr Val Glu Leu Tyr Ser Leu Thr Glu Thr Ala Ser 
130 135 140 

Gin Glu Ala Met Gly Leu Phe Thr Lys Trp Ser Arg Ser Leu Gly Phe 
145 150 155 160 

Leu Ser Gin Xaa 



<210> 61 

<211> 240 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (240) 

<223> Xaa equals stop translation 



<400> 61 

Met Arg Ala Leu Arg Arg Leu lie Gin Gly Arg lie Leu Leu Leu Thr 
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15 10 15 

He Cys Ala Ala Gly He Gly Gly Thr Phe Gin Phe Gly Tyr Asn Leu 
20 25 30 

Ser He lie Asn Ala Pro Thr Leu His He Gin Glu Phe Thr Asn Glu 
35 40 45 

Thr Trp Gin Ala Arg Thr Gly Glu Pro Leu Pro Asp His Leu Val Leu 
50 55 60 

Leu Met Trp Ser Leu He Val Ser Leu Tyr Pro Leu Gly Gly Leu Phe 
65 70 75 80 

Gly Ala Leu Leu Ala Gly Pro Leu Ala He Thr Leu Gly Arg Lys Lys 
85 90 95 

Ser Leu Leu Val Asn Asn He Phe Val Val Ser Ala Ala He Leu Phe 
100 105 110 

Gly Phe Ser Arg Lys Ala Gly Ser Phe Glu Met He Met Leu Gly Arg 
115 120 125 

Leu Leu Val Gly Val Asn Ala Gly Val Ser Met Asn He Gin Pro Met 
130 135 140 

Tyr Leu Gly Glu Ser Ala Pro Lys Glu Leu Arg Gly Ala Val Ala Met 
145 150 155 160 

Ser Ser Ala He Phe Thr Ala Leu Gly He Val Met Gly Gin Val Val 
165 170 175 

Gly Leu Ser Thr Thr Ala Ala Pro Gly Leu Arg Gly Leu Gly Arg Gly 
180 185 190 

Ala Gly Gly Ala Gly Gly Gly Ala Arg Cys Leu Pro Gly Leu Pro Cys 
195 200 205 

Pro Ala Pro Met Gly Ala Val Pro Ala Ser Gly Pro Glu Glu Thr Gly 
210 215 220 

Asp Lys Pro Arg Gly Ser Gly Gin Cys His Gly Ala Leu Arg Glu Xaa 
225 230 235 240 



<210> 62 
<211> 130 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
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<222> (130) 

<223> Xaa equals stop translation 
<400> 62 

Met Glu Arg Trp Val Asp Asp Ala Phe Trp Ser Phe Leu Phe Ser Leu 
15 10 15 

lie Leu lie Val lie Met Phe Leu Trp Arg Pro Ser Ala Asn Asn Gin 
20 25 30 

Arg Tyr Ala Phe Met Pro Leu lie Asp Asp Ser Asp Asp Glu lie Glu 
35 40 45 

Glu Phe Met Val Thr Ser Glu Asn Leu Thr Glu Gly He Lys Leu Arg 
50 55 50 

Ala Ser Lys Ser Val Ser Asn Gly Thr Ala Lys Pro Ala Thr Ser Glu 
65 70 75 80 

Asn Phe Asp Glu Asp Leu Lys Trp Val Glu Glu Asn He Pro Ser Ser 
85 90 95 

Phe Thr Asp Val Ala Leu Pro Val Leu Val Asp Ser Asp Glu Glu He 
100 105 110 

Met Thr Arg Ser Glu Met Ala Glu Lys Met Phe Ser Ser Glu Lys He 
115 120 125 



Met Xaa 
130 



<210> 63 

<211> 61 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (61) 

<223> Xaa equals stop translation 
<400> 63 

Met Phe Glu Cys Val He Leu Val Ser 
1 5 



Val Arg Cys Val Gly Leu He Pro 
20 

His Pro Leu Pro Ala Cys Ser Cys 
35 40 



Phe Leu Val Val Phe Val Val 

10 15 

Thr Gly Gin Ser Lys Glu Phe Gin 

25 30 

Tyr Pro Thr Asp Gin Thr Leu Asn 
45 



Ser Ser Trp Gly Cys Cys Leu Ala Pro His His Asp Xaa 
50 55 60 
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<210> 64 

<211> 98 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (98) 

<223> Xaa equals stop translation 
<400> 64 

Met Leu Leu Ser lie Gly Met Leu Met Leu Ser Ala Thr Gin Val Tyr 
15 10 15 

Thr lie Leu Thr Val Gin Leu Phe Ala Phe Leu Asn Leu Leu Pro Val 
20 25 30 

Glu Ala Asp He Leu Ala Tyr Asn Phe Glu Asn Ala Ser Gin Thr Phe 
35 40 45 

Asp Asp Leu Pro Ala Arg Phe Gly Tyr Arg Leu Pro Ala Glu Gly Leu 
50 55 60' 

Lys Gly Phe Leu lie Asn Ser Lys Pro Glu Asn Ala Cys Glu Pro He 
65 70 75 80 

Val Pro Pro Pro Val Lys Asp Asn Ser Ser Gly His Phe His Arg Val 
85 90 95 

Asn Xaa 



<210> 65 

<211> 54 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (54) 

<223> Xaa equals stop translation 
<400> 65 

Met Ala Ala Leu Leu Leu Ala Gly He Cys He Leu Leu Asn Gly Val 
15 10 15 

He Pro Gin Asp Gin Ser He Val Arg Thr Ser Leu Ala Val Leu Gly 
20 25 30 

Lys Gly Cys Leu Ala Ala Ser Phe Asn Cys He Phe Leu Tyr Thr Gly 
35 40 45 

Asn Cys He Pro Gin Xaa 
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50 



<210> 66 
<211> 64 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (64) 

<223> Xaa equals stop translation 

<400> 66 

Met His Trp Asn Leu Pro Gin Val Asn Leu Phe Ala Leu Leu Leu Leu 
1 5 10 "15 

Thr lie Leu Thr Leu Val Pro His Leu Val Val Pro Tyr His His Arg 
20 25 30 

His Tyr Gin Ala Gin. Gin Asn Asn Arg Glu Pro Tyr Leu Gin Asn Cys 
35 40 45 

Gin Ala His His Leu His Gin Leu Leu Pro Phe His Arg Asp Gin Xaa 
50 55 60 



<210> 67 
<211> 107 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 

<222> (107) 

<223> Xaa equals stop translation 
<400> 67 

Met Phe Cys Phe Tyr Leu Asn Tyr Phe Thr Asn Leu Phe Leu Phe Leu 
15 10 15 

Thr Cys Ser Arg Ser Glu Ser Leu Ser Ser Pro Thr Gly Pro Tyr Ser 
20 25 30 

Gly Phe Pro Phe Leu Lys Ser Pro Pro Val Arg Asn Ser Leu Asn Lys 
35 40 45 

Gly Pro Leu Leu Val Gin Tyr Tyr Ser Phe Ser Ser His Leu Arg Val 
50 55 60 



Pro Arg Lys Lys Lys Gin Val lie Arg Val Pro Val Arg Val Pro Pro 
65 70 75 80 
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Lys Ser Pro Ala Met Ser Pro Pro Ser Ser Pro Arg Phe His Phe Phe 
85 90 95 

Thr Phe Ser Gly Pro Phe Pro Asn Ser Tyr Xaa 

100 105 



<210> 68 

<211> 45 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 



<400> 68 

Met Arg Lys Thr Ala Trp Leu Cys 
1 5 

Gly Gin Val Thr Ser Leu Gin Tyr 
20 

Pro Ser Leu Val Arg Gly Thr His 
35 40 



Phe Phe Phe Gin Leu Cys Gly Leu 
10 15 

Arg Asn Cys Asn Val Glu lie Lys 
25 30 

Arg Ser lie Pro Xaa 
45 



<210> 69 

<211> 43 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 



<400> 69 

Met Asn Leu Leu Leu Leu Val Ser 
1 5 

Gly Ser Cys Phe His Met Thr Leu 
20 

Tyr Trp Ser Tyr Phe Val Arg Ser 
35 40 



Thr Trp Met Met Leu lie Gin Glu 
10 15 

Met Asn Glu Leu Ala Lys Arg Cys 
25 30 

His lie Xaa 



<210> 70 

<211> 58 

<212> PRT 

<213> Homo sapiens 
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<220> 
<221> SITE 
<222> (58) 

<223> Xaa equals stop translation 
<400> 70 

Met Pro Cys Thr Cys Thr Trp Arg Asn Trp Arg Gin Trp lie Arg Pro 
15 10 15 

Leu Val Ala Val lie Tyr Leu Val Ser lie Val Val Ala Val Pro Leu 
20 25 30 

Cys Val Trp Glu Leu Gin Lys Leu Glu Val Gly lie His Thr Lys Ala 
35 40 45 

Trp Phe lie Ala Gly lie Phe Leu Leu Xaa 
50 55 



<210> 71 

<211> 45 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals stop translation 
<400> 71 

Met Lys Ser His Ala Thr Leu Thr Gly Gly Ser Gly Phe Tyr Phe lie 
15 10 15 

Glu Leu Ser Phe Leu Leu Leu Arg Ser Val Leu Leu Val Leu Val Leu 
20 25 30 

Leu Trp Gin Phe Pro Lys Ser Leu Thr Gly Gin Glu Xaa 
35 40 45 



<210> 72 

<211> 71 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (43) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> 

<221> SITE 

<222> (44) 

<223> Xaa equals any of the naturally occurring L-amino acids 
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<220> 

<221> SITE 
<222> (49) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 

<222> (52) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (56) 

<223> Xaa equals any of the naturally occurring L-amino acids 

<220> 

<221> SITE 
<222> (71) 

<223> Xaa equals stop translation 

<400> 72 

Met Gly He Phe Ser Thr Leu Leu Leu Ala Ser Asp Ser Leu Leu Asn 
1 5 10 



15 



Leu He Leu Phe Phe Phe He Phe Ala Phe Ser Val Lys Leu Ser Ser 
20 25 30 

Ser Ser Phe Pro Ser Cys Cys Val Ser Val Xaa Xaa Leu Ser Val He 
35 40 45 

Xaa Glu Ser Xaa Ser Ser His Xaa Ala Thr Cys Ala His Thr Ser Leu 
50 55 60 

Ser Gly Thr Pro Val Met Xaa 
65 70 



<210> 73 
<211> 44 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals stop translation 



<400> 73 
Met Met Ser Pro 
1 

He Leu Leu Cys 
20 

His Asn Gly Arg 



Ser Gly He He 
5 

He Leu He Thr 
Val Cys Val Leu 



Val Tyr Val Ser 
10 

Phe Met Leu Ala 
25 

Phe He Phe Xaa 



Ala Thr Pro His 
15 

He Pro Ser He 
30 
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<210> 74 

<211> 43 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals stop translation 
<400> 74 

Met His Val His Cys Phe Ala lie His Val Leu Phe His Phe Cys Ser 
15 10 15 

Thr lie Ser Ala Asp Ala Leu Ser Phe Cys lie Phe Cys Tyr Gly Pro 
20 25 30 

Gin Thr Leu lie Asp Met Tyr Trp Asn Ser Xaa 
35 40 



<210> 75 

<211> 178 

^212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (67) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (178) 

<223> Xaa equals stop translation 
<400> 75 

Met Phe Gin Val Arg Pro Gly Trp Gin Leu Leu Leu Val Met Phe Ser 
15 10 15 

Ser Cys Ala Val Ser Asn Gin Leu Leu Val Trp Tyr Pro Ala Thr Ala 
20 25 30 

Leu Ala Asp Asn Lys Pro Val Ala Pro Asp Arg Arg lie Ser Gly His 
35 40 45 

Val Gly lie lie Phe Ser Met Ser Tyr Leu Glu Ser Lys Gly Leu Leu 
50 55 60 

Ala Thr Xaa Ser Glu Asp Arg Ser Val Arg lie Trp Lys Val Gly Asp 
65 70 75 80 
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Leu Arg Val Pro Gly 
85 

His Ser Ala Arg Val 
100 

Ser Ala Gly Glu Asp 
115 

lie Leu Gin Ala Phe 
130 

Gly Ala Glu lie His 
145 

Thr Glu Thr Glu Ser 
165 

Glu Xaa 



Gly Arg Val Gin Asn lie 
90 

Trp Gin Val Lys Leu Leu 
105 

Cys Val Cys Leu Val Trp 
120 

Arg Gly His Gin Asp Val 
135 

Ala Glu Leu Tyr Gin Glu 
150 155 

Leu Ala His Leu Phe Ala 
170 



Gly His Cys Phe Gly 
95 

Glu Asn Tyr Leu lie 
110 

Ser His Glu Gly Glu 
125 

Tyr Pro Val Val Val 
140 

Leu Ala Tyr Leu Glu 
160 

Leu Val Pro Arg Pro 
175 



<210> 76 

<211> 49 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (49) 

<223> Xaa equals stop translation 

<400> 76 

Met Val Thr Phe Ala Ser Ser Thr Leu Trp lie Ala Ala Phe Ser Tyr 
1 5 10 15 

Met Met Val Trp Met Val Thr lie He Gly Tyr Thr Leu Gly He Pro 
20 25 30 

Asp Val He Met Gly Asp His Leu Pro Gly Cys Trp Asp Gin Arg Ala 
35 40 45 

Xaa 



<210> 77 

<211> 14 

<212> PRT 

<213> Homo sapiens 



<400> 77 

Asn Tyr Phe Pro Val His Thr Val Gin Pro Asn Trp Tyr Val 
15 10 
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<210> 78 
<211> 31 

<212> PRT 

<213> Homo sapiens 
<400> 78 

Pro Val Phe Thr Val Asn Phe Leu Ala Trp Val His Ala Pro Pro Val 
15 10 15 

Ser lie Thr Val Asp Leu lie Pro Thr Leu Ala Gin Ala Trp Ser 
20 25 30 



<210> 79 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (19) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 79 

Trp lie Gin Arg lie Arg Thr Ser Ala Asp Gin Leu Gly Pro Lys Lys 
15 10 15 

Val Val Xaa Phe Gly Leu Ala Cys Cys Gly Val Ser Gly Leu Phe Tyr 
20 25 30 

Ala 



<210> 80 
<211> 351 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (78) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (326) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 80 

Pro Pro Gly Leu Cys Ala Ala lie Pro Leu Gin Thr Arg Ser Ala Gin 
1 5 10-15 



Gly Pro Trp Gly Gly Arg Gin Gly Ser Gly Trp Cys Trp Gly Thr Val 
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20 



25 



30 



Val Gly Ser Gly Ser Ser Gly Gly Gly Asn Ala Phe Thr Gly Leu Gly 
35 40 45 

Pro Val Ser Thr Leu Pro Ser Leu His Gly Lys Gin Gly Val Thr Ser 
50 55 60 

Val Thr Cys His Gly Gly Tyr Val Tyr Thr Thr Gly Arg Xaa Gly Ala 
65 70 75 80 

Tyr Tyr Gin Leu Phe Val Arg Asp Gly Gin Leu Gin Pro Val Leu Arg 
85 90 95 

Gin Lys Ser Cys Arg Gly Met Asn Trp Leu Ala Gly Leu Arg He Val 
100 105 110 

Pro Asp Gly Ser Met Val He Leu Gly Phe His Ala Asn Glu Phe Val 
115 120 125 

Val Trp Asn Pro Arg Ser His Glu Lys Leu His He Val Asn Cys Gly 
130 135 140 

Gly Gly His Arg Ser Trp Ala Phe Ser Asp Thr Glu Ala Ala Met Ala 
145 150 155 160 

Phe^Ala Tyr Leu Lys Asp Gly Asp Val Met Leu Tyr Arg Ala Leu Gly 
165 170 175 

Gly Cys Thr Arg Pro His Val He Leu Arg Glu Gly Leu His Gly Arg 
180 185 190 

Glu He Thr Cys Val Lys Arg Val Gly Thr He Thr Leu Gly Pro Glu 
195 200 205 

Tyr Gly Val Pro Ser Phe Met Gin Pro Asp Asp Leu Glu Pro Gly Ser 
210 215 220 

Glu Gly Pro Asp Leu Thr Asp He Val He Thr Cys Ser Glu Asp Thr 
225 230 235 240 

Thr Val Cys Val Leu Ala Leu Pro Thr Thr Thr Gly Ser Ala His Ala 
245 250 255 

Leu Thr Ala Val Cys Asn His He Ser Ser Val Arg Ala Val Ala Val 
260 265 270 

Trp Gly He Gly Thr Pro Gly Gly Pro Gin Asp Pro Gin Pro Gly Leu 

275 280 285 

Thr Ala His Val Val Ser Ala Gly Gly Arg Ala Glu Met His Cys Phe 
290 295 300 

Ser He Met Val Thr Pro Asp Pro Ser Thr Pro Ser Arg Leu Ala Cys 
305 310 315 320 



BNSDOCID: <WO 99462e9A1_l_> 



wo 99/46289 PCT/US99/0572 1 

52 



His Val Met His 

Gin Arg Asn Arg 
340 



Leu Xaa Ser His 
325 

His Arg Met Val 



Arg Leu Asp Glu 
330 

Lys Val Asp Pro 
345 



Tyr Trp Asp Arg 
335 

Glu Thr Arg 
350 



<210> 81 
<211> 38 
<212> PRT 

<213> Homo sapiens 

<400> 81 

Pro Pro Gly Leu Cys Ala Ala lie Pro Leu Gin Thr Arg Ser Ala Gin 
15 10 15 

Gly Pro Trp Gly Gly Arg Gin Gly Ser Gly Trp Cys Trp Gly Thr Val 
20 25 30 

Val Gly Ser Gly Ser Ser 
35 



<210> 82 
<211> 40 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (40) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 82 

Gly Gly Gly Asn Ala Phe Thr Gly Leu Gly Pro Val Ser Thr Leu Pro 
15 10 15 

Ser Leu His Gly Lys Gin Gly Val Thr Ser Val Thr Cys His Gly Gly 
20 25 30 

Tyr Val Tyr Thr Thr Gly Arg Xaa 
35 40 



<210> 83 
<211> 40 
<212> PRT 

<213> Homo sapiens 



<400> 83 

Gly Ala Tyr Tyr Gin Leu Phe Val Arg Asp Gly Gin Leu Gin Pro Val 
15 10 15 



Leu Arg Gin Lys Ser Cys Arg Gly Met Asn Trp Leu Ala Gly Leu Arg 
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20 25 30 



lie Val Pro Asp Gly Ser Met Val 
35 40 



<210> 84 

<211> 41 

<212> PRT 

<213> Homo sapiens 

<400> 84 

lie Leu Gly Phe His Ala Asn Glu Phe Val Val Trp Asn Pro Arg Ser 
15 10 15 

His Glu Lys Leu His lie Val Asn Cys Gly Gly Gly His Arg Ser Trp 
20 25 30 

Ala Phe Ser Asp Thr Glu Ala Ala Met 
35 40 



<210> 85 

<211> 42 

<212> PRT 

<213> Homo sapiens 

<400> 85 

Ala Phe Ala Tyr Leu Lys Asp Gly Asp Val Met Leu Tyr Arg Ala Leu 
15 10 15 

Gly Gly Cys Thr Arg Pro His Val lie Leu Arg Glu Gly Leu His Gly 
20 25 30 

Arg Glu lie Thr Cys Val Lys Arg Val Gly 
35 40 



<210> 86 

<211> 43 

<212> PRT 

<213> Homo sapiens 



<400> 86 
Thr lie Thr Leu 
1 

Asp Asp Leu Glu 
20 

lie Thr Cys Ser 
35 



Gly Pro Glu Tyr 
5 

Pro Gly Ser Glu 

Glu Asp Thr Thr 
40 



Gly Val Pro Ser 
10 

Gly Pro Asp Leu 
25 

Val Cys Val 



Phe Met Gin Pro 
15 

Thr Asp lie Val 
30 



<210> 87 
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<211> 41 
<212> PRT 

<213> Homo sapiens 

<400> 87 

Leu Ala Leu Pro Thr Thr Thr Gly Ser Ala His Ala Leu Thr Ala Val 
15 10 15 

Cys Asn His lie Ser Ser Val Arg Ala Val Ala Val Trp Gly lie Gly 
20 25 30 

Thr Pro Gly Gly Pro Gin Asp Pro Gin 
35 40 



<210> 88 

<211> 40 

<212> PRT 

<213> Homo sapiens 

<400> 88 

Pro Gly Leu Thr Ala His Val Val Ser Ala Gly Gly Arg Ala Glu Met 
15 10 15 

His Cys Phe Ser lie Met Val Thr Pro Asp Pro Ser Thr Pro Ser Arg 
20 25 30 

Leu Ala Cys His Val Met His Leu 
35 40 



<210> 89 

<211> 26 
<212> PRT 

<213> Homo sapiens 

<220> 
<221> SITE 
<222> (1) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 89 

Xaa Ser His Arg Leu Asp Glu Tyr Trp Asp Arg Gin Arg Asn Arg His 
15 10 15 

Arg Met Val Lys Val Asp Pro Glu Thr Arg 
20 25 



<210> 90 

<211> 88 

<212> PRT 

<213> Homo sapiens 



<400> 90 
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Leu Met Ser Leu 
1 

Ala Ser Ala Ser 
20 

Gin Gin Lys Glu 
35 

Lys Pro Lys lie 
50 

Ala Pro Glu Ser 
65 

Cys lie Thr Asn 



Leu Thr Ser Pro 
5 



Pro Ser Ala Val 



Pro Leu Ser His 
40 

His Cys Phe Arg 
55 

Cys Leu Ser Gly 
70 



Phe Val Cys Glu 
85 



His Gin Pro Pro 
10 

Pro Asn Gly Pro 
25 

Arg Phe Asn Glu 



Ser Leu Lys Arg 
60 

Val Leu Trp Leu 
75 



Pro Pro Pro Pro 
15 

Gin Ser Pro Lys 
30 

Phe Met Thr Ser 
45 

Gly Val Ser Ser 



His Val Trp Phe 
80 



<210> 91 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 91 

Phe Gin Asn Ala Lys Glu Glu Ala Ser Val Leu Pro Tyr Val Glu Thr 
15 10 15 

Val Phe Leu Phe Gly Gly Gly He Phe Ala Met Ala Leu Cys Leu He 
20 25 30 

Ser Asp Ala Leu Ser Ser Tyr Arg Asp Ser His Thr Asn Arg Val Leu 
35 40 45 

Thr Ser Pro Pro Phe 
50 



<210> 92 
<211> 45 
<212> PRT 

<213> Homo sapiens 



<400> 92 

Arg Leu Met Pro 

1 

Ala Gly Arg Glu 
20 

His Leu Leu Glu 
35 



Phe Pro Pro Ser 
5 

Asp Val Val Gly 

He Val Thr Met 
40 



Ser Pro Arg Leu 
10 

His Ser Cys Asn 

25 

Leu He Thr Trp 



Leu Val Thr Leu 
15 

Thr Leu Ser Ala 
30 

Phe 
45 



<210> 93 
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<211> 51 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (3) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 93 

Gly Gly Xaa Asp Asp Asp Glu Gly Pro Tyr Thr Pro Phe Asp Thr Pro 
15 10 15 

Ser Gly Lys Leu Glu Ttir Val Lys Trp Ala Phe Thr Trp Pro Leu Ser 
20 25 30 

Phe Val Leu Tyr Phe Thr Val Pro Asn Cys Asn Lys Pro Arg Trp Glu 
35 40 45 

Lys Trp Phe 
50 



<210> 94 

<211> 115 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (99) 

<223> Xaa equals any of the naturally occurring L-amino acids 



<400> 94 

Gly Gly Pro Arg Met Lys Arg Ser 
1 5 

Asn Ser Ser Val Ala Gly Pro Asp 
20 



Gly Asn Pro Gly Ala Glu Val Thr 
10 15 

Cys Cys Gly Gly Leu Gly Asn lie 
25 30 



Asp Phe Arg Gin Ala Asp Phe Cys Val Met Thr Arg Leu Leu Gly Tyr 
35 40 45 

Val Asp Pro Leu Asp Pro Ser Phe Val Ala Ala Val lie Thr lie Thr 
50 55 60 

Phe Asn Pro Leu Tyr Trp Asn Val Val Ala Arg Trp Glu His Lys Thr 
65 70 75 80 

Arg Lys Leu Ser Arg Ala Phe Gly Ser Pro Tyr Leu Ala Cys Tyr Ser 
85 90 95 



Leu Ser Xaa Thr lie Leu Leu Leu Asn Phe Leu Arg Ser His Cys Phe 
100 105 110 
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Thr Gin Ala 
115 



<210> 95 

<211> 51 

<212> PRT 

<213> Homo sapiens 



<400> 95 

Gly Gly Pro Arg Met Lys Arg Ser 

1 5 

Asn Ser Ser Val Ala Gly Pro Asp 
20 

Asp Phe Arg Gin Ala Asp Phe Cys 
35 40 



Gly Asn Pro Gly Ala Glu Val Thr 
10 15 

Cys Cys Gly Gly Leu Gly Asn lie 
25 30 

Val Met Thr Arg Leu Leu Gly Tyr 
45 



Val Asp Pro 
50 



<210> 96 

<211> 64 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 96 

Leu Asp Pro Ser Phe Val Ala Ala Val He Thr He Thr Phe Asn Pro 
15 10 15 

Leu Tyr Trp Asn Val Val Ala Arg Trp Glu His Lys Thr Arg Lys Leu 
20 25 30 

Ser Arg Ala Phe Gly Ser Pro Tyr Leu Ala Cys Tyr Ser Leu Ser Xaa 
35 40 45 

Thr He Leu Leu Leu Asn Phe Leu Arg Ser His Cys Phe Thr Gin Ala 
50 55 60 



<210> 97 

<211> 253 

<212> PRT 

<213> Homo sapiens 
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<400> 97 

Pro Gin Arg Ser Glu Leu Ala Ala Ala Ser Asn Arg Pro Cys Arg Val 
15 10 15 

Cys lie Ser Leu Leu Leu Cys Leu Glu Asp Arg Thr Met Pro Lys Lys 
20 25 30 

Ala Lys Pro Thr Gly Ser Gly Lys Glu Glu Gly Pro Ala Pro Cys Lys 
35 40 45 

Gin Met Lys Leu Glu Ala Ala Gly Gly Pro Ser Ala Leu Asn Phe Asp 
50 55 60 

Ser Pro Ser Ser Leu Phe Glu Ser Leu lie Ser Pro lie Lys Thr Glu 
65 70 75 80 

Thr Phe Phe Lys Glu Phe Trp Glu Gin Lys Pro Leu Leu lie Gin Arg 
85 90 95 

Asp Asp Pro Ala Leu Ala Thr Tyr Tyr Gly Ser Leu Phe Lys Leu Thr 
100 105 110 

Asp Leu Lys Ser Leu Cys Ser Arg Gly Met Tyr Tyr Gly Arg Asp Val 
115 120 125 

Asn Val Cys Arg Cys Val Asn Gly Lys Lys Lys Val Leu Asn Lys Asp 
130 135 140 

Gly Lys Ala His Phe Leu Gin Leu Arg Lys Asp Phe Asp Gin Lys Arg 
145 150 155 160 

Ala Thr lie Gin Phe His Gin Pro Gin Arg Phe Lys Asp Glu Leu Trp 
165 170 175 

Arg lie Gin Glu Lys Leu Glu Cys Tyr Phe Gly Ser Leu Val Gly Ser 
180 185 190 

Asn Val Tyr lie Thr Pro Ala Asp Leu Arg Ala Cys Arg Pro lie Met 
195 200 205 

Met Met Ser Arg Phe Ser Ser Cys Ser Trp Arg Glu Arg Asn Thr Gly 
210 215 220 

Ala Ser Thr Thr Pro Leu Cys Pro Trp His Glu Ser Thr Ala Trp Arg 
225 230 235 240 

Pro Arg Lys Gly Ser Ala Gly Arg Cys Met Ser Leu Cys 
245 250 



<210> 98 

<211> 44 

<212> PRT 

<213> Homo sapiens 
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<400> 98 

Pro Gin Arg Ser Glu Leu Ala Ala 
1 5 

Cys lie Ser Leu Leu Leu Cys Leu 
20 

Ala Lys Pro Thr Gly Ser Gly Lys 
35 40 



Ala Ser Asn Arg Pro Cys Arg Val 
10 15 

Glu Asp Arg Thr Met Pro Lys Lys 
25 30 

Glu Glu Gly Pro 



<210> 99 
<211> 45 
<212> PRT 

<213> Homo sapiens 



<400> 99 
Ala Pro Cys Lys 
1 

Leu Asn Phe Asp 
20 

He Lys Thr Glu 
35 



Gin Met Lys Leu 
5 

Ser Pro Ser Ser 

Thr Phe Phe Lys 
40 



Glu Ala Ala Gly 
10 

Leu Phe Glu Ser 
25 

Glu Phe Trp Glu 



Gly Pro Ser Ala 
15 

Leu He Ser Pro 
30 

Gin 
45 



<210> 100 
<211> 44 
<212> PRT 

<213> Homo sapiens 



<400> 100 
Lys Pro Leu Leu 
1 

Gly Ser Leu Phe 
20 

Met Tyr Tyr Gly 
35 



He Gin Arg Asp 
5 

Lys Leu Thr Asp 

Arg Asp Val Asn 
40 



Asp Pro Ala Leu 
10 

Leu Lys Ser Leu 
25 

Val Cys Arg Cys 



Ala Thr Tyr Tyr 
15 

Cys Ser Arg Gly 
30 



<210> 101 
<211> 45 
<212> PRT 

<213> Homo sapiens 



<400> 101 

Val Asn Gly Lys Lys Lys Val Leu 
1 5 

Leu Gin Leu Arg Lys Asp Phe Asp 
20 



Asn Lys Asp Gly Lys Ala His Phe 
10 15 

Gin Lys Arg Ala Thr He Gin Phe 
25 30 
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His Gin Pro Gin Arg Phe Lys Asp Glu Leu Trp Arg lie 
35 40 45 



<210> 102 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 102 

Gin Glu Lys Leu Glu Cys Tyr Phe Gly Ser Leu Val Gly Ser Asn Val 
15 10 15 

Tyr lie Thr Pro Ala Asp Leu Arg Ala Cys Arg Pro lie Met Met Met 
20 25 30 

Ser Arg Phe Ser Ser Cys Ser Trp Arg Glu Arg Asn 
35 40 



<210> 103 
<211> 31 
<212> PRT 

<213> Homo sapiens 
<400> 103 

Thr Gly Ala Ser Thr Thr Pro Leu Cys Pro Trp His Glu Ser Thr Ala 
15 10 15 

Trp Arg Pro Arg Lys Gly Ser Ala Gly Arg Cys Met Ser Leu Cys 
20 25 30 



<210> 104 
<211> 53 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (53) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 104 

Gly Gly Gly lie His Arg Leu His Asn Gly Ala Leu Gin Leu Arg Val 
15 10 15 

Leu Gin Arg Val Glu His Leu His Leu Leu His His Ala Val Lys His 
20 25 30 

lie Cys Thr Ala Ser Leu Pro Val Leu His Gly Phe lie Ala Ala Gin 
35 40 45 



Cys Arg Pro Gly Xaa 
50 
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<210> 105 
<211> 151 
<212> PRT 

<213> Homo sapiens 



<400> 105 
Trp Asp Arg Trp 
1 

Asp Leu Ala Gly 
20 

Gin Gly Cys Arg 
35 

Leu Arg Arg Gin 
50 

Leu Cys Gly Asn 
65 

Lys Ala Gly Val 



Gly Ser Cys Glu 

100 

Ala Leu Pro Pro 
115 

Leu Asn Gin Phe 
130 

Lys Lys Lys Lys 
145 



Ser Asp Ser Ala 
5 

Glu Leu Glu Glu 



Ala Arg Arg Pro 
40 

Val Thr Ser Leu 
55 

Asp Ser Val Tyr 
70 

Pro Glu Ala Lys 
85 

Leu Leu Thr Ala 



Pro Ala Leu Trp 
120 

Thr Leu Gin Lys 
135 

Lys Lys Lys 
150 



Leu Arg Arg Leu 
10 

Leu Glu Glu Glu 
25 

Trp Glu Leu Phe 



Val Val Leu Gly 
60 

Ala Tyr Ala Ser 
75 

lie Gin Tyr Ala 
90 

Val Val Ser Val 
105 

Gly Gly Thr Pro 



Lys Lys Lys Lys 
140 



Arg Gly Ser Gly 

15 

Arg Ala Ala Cys 
30 

Gin His Arg Ala 
45 

Ser Ala Met Glu 



Ser Val Phe Arg 
80 

lie He Gly Thr 
95 

Ser Leu Glu Gly 
110 

Arg Ser Ser Ala 

125 

Lys Lys Lys Lys 



<210> 106 
<211> 37 
<212> PRT 

<213> Homo sapiens 



<400> 106 

Arg Arg Leu Arg Gly Ser Gly Asp 
1 5 

Glu Glu Glu Arg Ala Ala Cys Gin 
20 

Glu Leu Phe Gin His 
35 



Leu Ala Gly Glu Leu Glu Glu Leu 
10 15 

Gly Cys Arg Ala Arg Arg Pro Trp 
25 30 
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<210> 107 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 107 

Arg Gin Val Thr Ser Leu Val Val 
1 5 

Gly Asn Asp Ser Val Tyr Ala Tyr 
20 



Leu Gly Ser Ala Met Glu Leu Cys 
10 15 

Ala Ser Ser Val Phe 
25 



<210> 108 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 108 

Thr Gly Ser Cys Glu Leu Leu Thr Ala Val Val Ser Val Ser Leu Glu 
15 10 15 

Gly Ala Leu Pro Pro Pro Ala Leu Trp Gly Gly Thr Pro Arg Ser Ser 
20 25 30 

Ala Leu 



<210> 109 

<211> 49 

<212> PRT 

<213> Homo sapiens 

<400> 109 

His Glu Leu Arg Leu Arg Lys Asn Thr Val Lys Phe Ser Leu Tyr Arg 
15 10 15 

His Phe Lys Asn Thr Leu lie Phe Ala Val Leu Ala Ser lie Val Phe 
20 25 30 

Met Gly Trp Thr Thr Lys Thr Phe Arg lie Ala Lys Cys Gin Ser Asp 
35 40 45 

Trp 



<210> 110 

<211> 24 

<212> PRT 

<213> Homo sapiens 



<400> 110 

Trp lie Pro Arg Ala Ala Gly lie Arg His Glu Glu Ser lie Ala Gin 
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10 



15 



Arg Ser Tyr Phe Arg Thr Leu Leu 
20 



<210> 111 

<211> 104 

<212> PRT 

<213> Homo sapiens 

<400> 111 

Ala Asp Thr Asn Phe Thr Gin Glu Thr Ala Met Thr Met lie Thr Pro 
15 10 15 

Ser Ser Lys Leu Thr Leu Thr Lys Gly Asn Lys Ser Trp Ser Ser Thr 
20 25 30 

Ala Val Ala Ala Ala Leu Glu Leu Val Asp Pro Pro Gly Cys Arg Asn 
35 40 45 

Ser Ala Arg Gly lie Asn Cys Ser Ala Phe Leu Leu Pro Tyr Ser Ser 
50 55 60 

His Val Trp Val Pro Leu Ser Gly Val Val Pro Leu Cys Gin Arg Asn 
65 70 75 80 

Gin Gly His Thr Val Trp Val Gin lie He Tyr Ser Arg Ser Ser Phe 

85 90 95 

Thr Asp Val Phe He Ser Thr Arg 
100 



<210> 112 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 112 

Met Thr Met He Thr Pro Ser Ser Lys Leu Thr Leu Thr Lys Gly Asn 
^5 10 15 

Lys Ser Trp Ser Ser Thr Ala Val Ala Ala 
20 25 



<210> 113 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 113 

Arg Gly He Asn Cys Ser Ala Phe Leu Leu Pro Tyr Ser Ser His Val 
15 10 15 
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Trp Val Pro Leu 
20 



<210> 114 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 114 

Val Val Pro Leu Cys Gin Arg Asn Gin Gly His Thr Val Trp Val Gin 
15 10 15 

lie lie Tyr Ser Arg Ser Ser Phe 
20 



<210> 115 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 115 

Asn Phe Asp lie Lys Val Leu Asn Ala Gin Arg Ala Gly Tyr Lys Ala 
15 10 15 

Ala lie Val His Asn Val Asp Ser Asp Asp 
20 25 



<210> 116 

<211> 28 

<212> PRT 

<213> Homo sapiens 

<400> 116 

Val Leu Lys Lys lie Asp lie Pro Ser Val Phe lie Gly Glu Ser Ser 
15 10 15 

Ala Asn Ser Leu Lys Asp Glu Phe Thr Tyr Glu Lys 
20 25 



<210> 117 

<211> 30 

<212> PRT 

<213> Homo sapiens 

<400> 117 

Pro Glu Phe Ser Leu Pro Leu Glu Tyr Tyr Leu lie Pro Phe Leu lie 
15 10 15 

lie Val Gly lie Cys Leu lie Leu lie Val He Phe Met He 
20 25 30 
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<210> 118 
<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 118 

Thr Lys Phe Val Gin Asp Arg His 
1 5 

Lys Asp Gin Leu Lys Lys Leu Pro 
20 

Glu Tyr 



Arg Ala Arg Arg Asn Arg Leu Arg 
10 15 

Val His Lys Phe Lys Lys Gly Asp 
25 30 



<210> 119 

<211> 27 

<212> PRT 

<213> Homo sapiens 

<400> 119 

Glu Asp Gly Asp Lys Leu Arg lie 
1 5 

Cys Lys Cys Val Asp Pro Trp Leu 
20 



Leu Pro Cys Ser His Ala Tyr His 
10 15 

Thr Lys Thr 
25 



<210> 120 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 120 

Val Val Pro Ser Gin Gly Asp Ser 
1 5 

Glu Glu Asn Glu Val Thr Glu His 
20 



Asp Ser Asp Thr Asp Ser Ser Gin 
10 15 



<210> 121 

<211> 29 

<212> PRT 

<213> Homo sapiens 

<400> 121 

Gin Ser Phe Gly Ala Leu Ser Glu 
1 5 

Glu Ser Ser Asp Tyr Glu Glu Asp 
20 



Ser Arg Ser His Gin Asn Met Thr 
10 15 

Asp Asn Glu Asp Thr 
25 
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<210> 122 

<211> 259 
<212> PRT 

<213> Homo sapiens 
<400> 122 

lie Arg Arg Leu Asp Cys Asn Phe Asp lie Lys Val Leu Asn Ala Gin 
15 10 15 

Arg Ala Gly Tyr Lys Ala Ala He Val His Asn Val Asp Ser Asp Asp 
20 25 30 

Leu He Ser Met Gly Ser Asn Asp He Glu Val Leu Lys Lys He Asp 
35 40 45 

He Pro Ser Val Phe He Gly Glu Ser Ser Ala Asn Ser Leu Lys Asp 
50 55 60 

Glu Phe Thr Tyr Glu Lys Gly Gly His Leu He Leu Val Pro Glu Phe 
65 70 75 80 

Ser Leu Pro Leu Glu Tyr Tyr Leu He Pro Phe Leu He He Val Gly 
85 90 95 

He Cys Leu He Leu He Val He Phe Met He Thr Lys Phe Val Gin 
100 105 110 

Asp Arg His Arg Ala Arg Arg Asn Arg Leu Arg Lys Asp Gin Leu Lys 
115 120 * 125 

Lys Leu Pro Val His Lys Phe Lys Lys Gly Asp Glu Tyr Asp Val Cys 
130 135 140 

Ala He Cys Leu Asp Glu Tyr Glu Asp Gly Asp Lys Leu Arg He Leu 
145 150 155 160 

Pro Cys Ser His Ala Tyr His Cys Lys Cys Val Asp Pro Trp Leu Thr 
165 170 175 

Lys Thr Lys Lys Thr Cys Pro Val Cys Lys Gin Lys Val Val Pro Ser 
180 185 190 

Gin Gly Asp Ser Asp Ser Asp Thr Asp Ser Ser Gin Glu Glu Asn Glu 



Val Thr Glu His Thr Pro Leu Leu Arg Pro Leu Ala Ser Val Ser Ala 
210 215 220 

Gin Ser Phe Gly Ala Leu Ser Glu Ser Arg Ser His Gin Asn Met Thr 



195 



200 



205 



225 



230 



235 



240 



Glu Ser Ser Asp Tyr Glu Glu Asp Asp Asn Glu Asp Thr Asp Ser Ser 
245 250 255 
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Asp Ala Glu 



<210> 123 

<211> 36 

<212> PRT 

<213> Homo sapiens 

<400> 123 

Ala Gin Cys Ser lie Tyr Leu lie 
1 5 

Leu Pro Ala Lys Leu Val Gly Phe 
20 

Arg Pro Ala Gin 

35 



Gin Val He Phe Gly Ala Val Asp 
10 15 

Leu Val He Asn Ser Leu Gly Arg 
25 30 



<210> 124 
<211> 188 
<212> PRT 

<213> Homo sapiens 
<400> 124 

Gly Thr Val Gin His Leu Pro Asn Pro Gly Asp Leu Trp Cys Cys Gly 
1 5 10 15 

Pro Ala Cys Gin Ala Cys Gly Leu Pro Cys His Gin Leu Pro Gly Ser 
20 25 30 

Pro Ala Cys Pro Asp Gly Cys Thr Ala Ala Gly Arg His Leu His Pro 
35 40 45 

Ala Gin Trp Gly Asp Thr Pro Gly Pro Val His Cys Pro Asn Leu Ser 
50 55 60 

Cys Cys Ala Gly Glu Gly Leu Ser Gly Cys Leu Leu Gin Leu His Leu 

70 75 80 

Pro Val Tyr Trp Glu Leu Tyr Pro Thr Met He Arg Gin Thr Gly Met 
85 90 95 

Gly Met Gly Ser Thr Met Ala Arg Val Gly Ser He Val Ser Pro Leu 
100 105 110 

Val Ser Met Thr Ala Glu Leu Tyr Pro Ser Met Pro Leu Phe He Tyr 
115 120 125 

Gly Ala Val Pro Val Ala Ala Ser Ala Val Thr Val Leu Leu Pro Glu 
130 135 3^40 

Thr Leu Gly Gin Pro Leu Pro Asp Thr Val Gin Asp Leu Glu Ser Arg 
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145 150 

Lys Gly Lys Gin Thr Arg Gin Gin Gin 
165 

Pro Leu Gin Ala Ser Ala Gin Glu Lys 

180 185 




PCT/US99/0572I 

68 

155 160 

Glu His Gin Lys Tyr Met Val 
170 175 

Asn Gly Leu 



<210> 125 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 125 

Leu Pro Asn Pro Gly Asp Leu Trp Cys Cys Gly Pro Ala Cys Gin Ala 
15 10 15 

Cys Gly Leu Pro Cys His Gin 

20 



<210> 126 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 126 

Gly Cys Thr Ala Ala Gly Arg His Leu His Pro Ala Gin Trp Gly Asp 
15 10 15 

Thr Pro Gly Pro Val His Cys Pro Asn Leu 
20 25 



<210> 127 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 127 

Leu His Leu Pro Val Tyr Trp Glu Leu Tyr Pro Thr Met lie Arg Gin 
15 10 15 

Thr Gly Met Gly Met Gly 
20 



<210> 128 
<211> 23 
<212> PRT 

<213> Homo sapiens 



<400> 128 

Leu Val Ser Met Thr Ala Glu Leu Tyr Pro Ser Met Pro Leu Phe He 
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10 



15 



Tyr Gly Ala Val Pro Val Ala 
20 



<210> 129 

<211> 27 

<212> PRT 

<213> Homo sapiens 

<400> 129 

Pro Asp Thr Val Gin Asp Leu Glu 

1 5 

Gin Gin Gin Glu His Gin Lys Tyr 
20 



Ser Arg Lys Gly Lys Gin Thr Arg 
10 15 

Met Val Pro 
25 



<210> 130 

<211> 720 

<212> PRT 

<213> Homo sapiens 



<400> 130 

Cys Leu Glu Ala 

1 

Glu Asn Ser Gly 
20 

Pro Val Leu Thr 
35 

Gly Arg Pro Glu 
50 

Tyr Leu Asp Arg 
65 

Arg Glu Glu lie 



Arg Leu Glu Arg 
100 

Leu Val Asp Val 
115 

Val Cys Thr Ser 
130 

Gly Ser lie Pro 
145 



Ala Met lie Glu 
5 



Lys Ser Gly Gin 



Phe Glu Leu Ser 
40 

Lys lie His Asn 
55 

Tyr Met His Arg 
70 



Lys Arg Leu Lys 
85 



Tyr Leu Ser Tyr 



Leu Gin Tyr Ala 
120 



Pro Val Asp Asp 
135 



Ser Gin Thr Leu 
150 



Gly Glu lie Glu 
10 

Glu His Trp Phe 
25 

Arg Phe Glu Phe 



Lys Leu Glu Phe 
60 

Asn Arg Glu lie 
75 

Asp Tyr Leu Thr 
90 

Gly Ser Gly Pro 
105 

Leu Glu Phe Ala 



lie Asp Ala Ser 
140 

Pro Ser Thr Thr 
155 



Ser Leu His Ser 
15 

Thr Glu Leu Pro 
30 

Asn Gin Ala Leu 
45 

Pro Gin Val Leu 



Thr Arg He Lys 
80 

Val Leu Gin Gin 
95 

Lys Arg Phe Pro 
110 

Ser Ser Lys Pro 
125 

Ser Pro Pro Ser 



Glu Gin Gin Gly 
160 
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Ala Leu Ser Ser Glu Leu Pro Ser Thr Ser Pro Ser Ser Val Ala Ala 
165 170 175 

lie Ser Ser Arg Ser Val He His Lys Pro Phe Thr Gin Ser Arg He 
180 185 190 

Pro Pro Asp Leu Pro Met His Pro Ala Pro Arg His He Thr Glu Glu 
195 200 205 

Glu Leu Ser Val Leu Glu Ser Cys Leu His Arg Trp Arg Thr Glu He 
210 215 220 

Glu Asn Asp Thr Arg Asp Leu Gin Glu Ser He Ser Arg He His Arg 
225 230 235 240 

Thr He Glu Leu Met Tyr Ser Asp Lys Ser Met He Gin Val Pro Tyr 
245 250 255 

Arg Leu His Ala Val Leu Val His Glu Gly Gin Ala Asn Ala Gly His 
260 265 270 

Tyr Trp Ala Tyr He Phe Asp His Arg Glu Ser Arg Trp Met Lys Tyr 
275 280 285 

Asn Asp He Ala Val Thr Lys Ser Ser Trp Glu Glu Leu Val Arg Asp 
290 295 300 

Ser Phe Gly Gly Tyr Arg Asn Ala Ser Ala Tyr Cys Leu Met Tyr He 
305 310 315 320 

Asn Asp Lys Ala Gin Phe Leu He Gin Glu Glu Phe Asn Lys Glu Thr 
325 330 335 

Gly Gin Pro Leu Val Gly He Glu Thr Leu Pro Pro Asp Leu Arg Asp 
340 345 350 

Phe Val Glu Glu Asp Asn Gin Arg Phe Glu Lys Glu Leu Glu Glu Trp 
355 360 365 

Asp Ala Gin Leu Ala Gin Lys Ala Leu Gin Glu Lys Leu Leu Ala Ser 
370 375 380 

Gin Lys Leu Arg Glu Ser Glu Thr Ser Val Thr Thr Ala Gin Ala Ala 
385 390 395 400 

Gly Asp Pro Glu Tyr Leu Glu Gin Pro Ser Arg Ser Asp Phe Ser Lys 
405 410 415 

His Leu Lys Glu Glu Thr He Gin He He Thr Lys Ala Ser His Glu 
420 425 430 

His Glu Asp Lys Ser Pro Glu Thr Val Leu Gin Ser Ala He Lys Leu 
435 440 445 



Glu Tyr Ala Arg Leu Val Lys Leu Ala Gin Glu Asp Thr Pro Pro Glu 
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450 



455 



460 



Thr Asp Tyr Arg Leu His His Val Val Val Tyr Phe lie Gin Asn Gin 
465 470 475 480 

Ala Pro Lys Lys lie lie Glu Lys Thr Leu Leu Glu Gin Phe Gly Asp 



Arg Asn Leu Ser Phe Asp Glu Arg Cys His Asn lie Met Lys Val Ala 
500 505 510 

Gin Ala Lys Leu Glu Met lie Lys Pro Glu Glu Val Asn Leu Glu Glu 
515 520 525 

Tyr Glu Glu Trp His Gin Asp Tyr Arg Lys Phe Arg Glu Thr Thr Met 
530 535 540 

Tyr Leu lie lie Gly Leu Glu Asn Phe Gin Arg Glu Ser Tyr lie Asp 
545 550 555 560 

Ser Leu Leu Phe Leu lie Cys Ala Tyr Gin Asn Asn Lys Glu Leu Leu 
565 570 575 

Ser Lys Gly Leu Tyr Arg Gly His Asp Glu Glu Leu lie Ser His Tyr 
580 585 590 

Arg Arg Glu Cys Leu Leu Lys Leu Asn Glu Gin Ala Ala Glu Leu Phe 
595 600 605 

Glu Ser Gly Glu Asp Arg Glu Val Asn Asn Gly Leu lie lie Met Asn 
610 615 620 

Glu Phe lie Val Pro Phe Leu Pro Leu Leu Leu Val Asp Glu Met Glu 
625 630 635 640 

Glu Lys Asp lie Leu Ala Val Glu Asp Met Arg Asn Arg Trp Cys Ser 
645 650 655 

Tyr Leu Gly Gin Glu Met Glu Pro His Leu Gin Glu Lys Leu Thr Asp 
660 665 670 

Phe Leu Pro Lys Leu Leu Asp Cys Ser Met Glu lie Lys Ser Phe His 
675 680 685 

Glu Pro Pro Lys Leu Pro Ser Tyr Ser Thr His Glu Leu Cys Glu Arg 
690 695 700 

Phe Ala Arg lie Met Leu Ser Leu Ser Arg Thr Pro Ala Asp Gly Arg 
705 710 715 720 



<210> 131 



485 



490 



495 
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<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 131 

Met lie Glu Gly Glu lie Glu Ser Leu His Ser Glu Asn Ser Gly Lys 
15 10 15 

Ser Gly Gin Glu His Trp Phe Thr 
20 



<210> 132 

<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 132 

Phe Glu Leu Ser Arg Phe Glu Phe Asn Gin Ala Leu Gly Arg Pro Glu 
15 10 15 

Lys lie His Asn Lys Leu Glu Phe Pro 
20 25 



<210> 133 
<211> 26 
<212> PRT 

<213> Homo sapiens 
<400> 133 

Glu lie Thr Arg lie Lys Arg Glu 
1 5 

Leu Thr Val Leu Gin Gin Arg Leu 
20 



Glu lie Lys Arg Leu Lys Asp Tyr 
10 15 

Glu Arg 
25 



<210> 134 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 134 

Pro Lys Arg Phe Pro Leu Val Asp Val Leu Gin Tyr Ala Leu Glu Phe 
15 10 15 

Ala Ser Ser Lys Pro Val Cys Thr Ser Pro Val 
20 25 



<210> 135 

<211> 26 

<212> PRT 

<213> Homo sapiens 
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<400> 135 

lie Pro Ser Gin Thr Leu Pro Ser 
1 5 

Ser Ser Glu Leu Pro Ser Thr Ser 
20 



Thr Thr Glu Gin Gin Gly Ala Leu 
10 15 

Pro Ser 
25 



<210> 136 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 136 

Ser Val lie His Lys Pro Phe Thr Gin Ser Arg lie Pro Pro Asp Leu 
1 5 10 15 

Pro Met His Pro Ala Pro Arg His 
20 



<210> 137 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 137 

Cys Leu His Arg Trp Arg Thr Glu lie Glu Asn Asp Thr Arg Asp Leu 
15 10 15 

Gin Glu Ser lie Ser Arg lie 
20 



<210> 138 
<211> 28 
<212> PRT 

<213> Homo sapiens 



<400> 138 

Lys Ser Met lie Gin Val Pro Tyr 
1 5 

Glu Gly Gin Ala Asn Ala Gly His 
20 



Arg Leu His Ala Val Leu Val His 
10 15 

Tyr Trp Ala Tyr 
25 



<210> 139 
<211> 29 
<212> PRT 

<213> Homo sapiens 



<400> 139 

Arg Trp Met Lys Tyr Asn Asp lie Ala Val Thr Lys Ser Ser Trp Glu 
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15 10 15 

Glu Leu Val Arg Asp Ser Phe Gly Gly Tyr Arg Asn Ala 
20 25 



<210> 140 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 140 

lie Asn Asp Lys Ala Gin Phe Leu lie Gin Glu Glu Phe Asn Lys Glu 
15 10 15 

Thr Gly Gin Pro Leu Val Gly lie 
20 



<210> 141 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 141 

Met lie Gin Val Pro Tyr Arg Leu His Ala Val Leu Val His Glu Gly 
1, 5 10 15 

Gin Ala Asn Ala Gly His Tyr 
20 



<210> 142 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<400> 142 

Asp Asn Gin Arg Phe Glu Lys Glu Leu Glu Glu Trp Asp Ala Gin Leu 
15 10 15 

Ala Gin Lys Ala Leu Gin Glu Lys Leu Leu 
20 25 



<210> 143 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<400> 143 

Ser Glu Thr Ser Val Thr Thr Ala Gin Ala Ala Gly Asp Pro Glu Tyr 
1 5 10 . 15 



Leu Glu Gin Pro Ser Arg Ser 
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20 



<210> 144 

<211> 28 

<212> PRT 

<213> Homo sapiens 



<400> 144 

Gin lie lie Thr Lys Ala Ser His 
1 5 

Thr Val Leu Gin Ser Ala He Lys 
20 



Glu His Glu Asp Lys Ser Pro Glu 
10 15 

Leu Glu Tyr Ala 
25 



<210> 145 

<211> 28 

<212> PRT 

<213> Homo sapiens 



<400> 145 

Leu Ala Gin Glu Asp Thr Pro Pro 
1 5 

Val Val Val Tyr Phe He Gin Asn 
20 



Glu Thr Asp Tyr Arg Leu His His 
10 15 

Gin Ala Pro Lys 
25 



<210> 146 

<211> 29 

<212> PRT 

<213> Homo sapiens 



<400> 146 

Gly Asp Arg Asn Leu Ser Phe Asp 
1 5 

Val Ala Gin Ala Lys Leu Glu Met 
20 



Glu Arg Cys His Asn He Met Lys 
10 15 

He Lys Pro Glu Glu 
25 



<210> 147 

<2H> 26 

<212> PRT 

<213> Homo sapiens 



<400> 147 



Glu Glu Trp His Gin Asp Tyr Arg Lys Phe Arg Glu Thr Thr Met Tyr 
^5 10 15 

Leu He He Gly Leu Glu Asn Phe Gin Arg 
20 25 
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<210> 148 
<211> 29 
<212> PRT 

<213> Homo sapiens 
<400> 148 

lie Cys Ala Tyr Gin Asn Asn Lys Glu Leu Leu Ser Lys Gly Leu Tyr 
15 10 15 

Arg Gly His Asp Glu Glu Leu lie Ser His Tyr Arg Arg 
20 25 



<210> 149 
<211> 28 
<212> PRT 

<213> Homo sapiens 

<400> 149 

Cys Leu Leu Lys Leu Asn Glu Gin Ala Ala Glu Leu Phe Glu Ser Gly 
15 10 15 

Glu Asp Arg Glu Val Asn Asn Gly Leu lie lie Met 
20 25 



<210> 150 
<211> 31 
<212> PRT 

<213> Homo sapiens 
<400> 150 

Val Asp Glu Met Glu Glu Lys Asp lie Leu Ala Val Glu Asp Met Arg 
15 10 15 

Asn Arg Trp Cys Ser Tyr Leu Gly Gin Glu Met Glu Pro His Leu 
20 25 30 



<210> 151 
<211> 25 

<212> PRT 

<213> Homo sapiens 
<400> 151 

Gin Glu Lys Leu Thr Asp Phe Leu Pro Lys Leu Leu Asp Cys Ser Met 
15 10 15 

Glu lie Lys Ser Phe His Glu Pro Pro 
20 25 



<210> 152 
<211> 21 
<212> PRT 
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<213> Homo sapiens 
<400> 152 

Gin He Ala Thr Ser Val His His Asn He Asn Arg Lys Lys Arg Ser 

Val Leu Arg Leu Leu 
20 



<210> 153 

<211> 32 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (10) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<220> 

<221> SITE 
<222> (22) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 153 

Pro Leu Leu Arg Gly Leu Phe He Arg Xaa Arg Ala Gly His Tyr Glu 
^ ' S 10 15 

Cys Val Phe His Glu Xaa Val Glu Gly Gly Ala Cys Cys Glu Gin Cys 
20 25 30 



<210> 154 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 154 

Leu Val Asn Asn Ser Phe Phe Leu Glu Phe He Tyr Arg Pro Asp Ser 
^5 10 15 

Lys Asn Trp Gin Tyr Gin Glu Thr He Lys Lys Gly Asp Leu Leu Leu 
20 25 30 

Asn Arg Val Gin Lys Leu Ser Arg Val He Asn Met 
35 40 



<210> 155 
<211> 34 
<212> PRT 
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<213> Homo sapiens 
<400> 155 

lie Arg Glu Leu Ser Arg Phe lie Ala Ala Gly Arg Leu His Cys Lys 
15 10 15 

lie Asp Lys Val Asn Glu lie Val Glu Thr Asn Arg Tyr Ser His Phe 
20 25 30 

Ser Glu 



<210> 156 
<211> 195 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (11) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 156 

Gly Ser Gin Pro Pro Gly Pro Val Pro Glu Xaa Leu lie Arg lie Tyr 
15 10 15 

Ser Met Arg Phe Cys Pro Tyr Ser His Arg Thr Arg Leu Val Leu Lys 
20 25 30 

Ala Lys Asp lie Arg His Glu Val Val Asn lie Asn Leu Arg Asn Lys 
35 40 45 

Pro Glu Trp Tyr Tyr Thr Lys His Pro Phe Gly His He Pro Val Leu 
50 55 60 

Glu Thr Ser Gin Cys Gin Leu He Tyr Glu Ser Val He Ala Cys Glu 
65 70 75 80 

Tyr Leu Asp Asp Ala Tyr Pro Gly Arg Lys Leu Phe Pro Tyr Asp Pro 
85 90 95 

Tyr Glu Arg Ala Arg Gin Lys Met Leu Leu Glu Leu Phe Cys Lys Val 
100 105 110 

Pro His Leu Thr Lys Glu Cys Leu Val Ala Leu Arg Cys Gly Arg Glu 
115 120 125 

Cys Thr Asn Leu Lys Ala Ala Leu Arg Gin Glu Phe Ser Asn Leu Glu 
130 135 140 

Glu He Leu Glu Tyr Gin Asn Thr Thr Phe Phe Gly Gly Thr Cys He 
145 150 155 160 



Ser Met He Asp Tyr Leu Leu Trp Pro Trp Phe Glu Arg Leu Asp Val 
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165 



170 



175 



Tyr Gly He Leu Asp Cys Val Ser His Thr Pro Ala Cys Gly Ser Gly 

185 190 



Tyr Gin Pro 
195 



<210> 157 
<211> 14 
<212> PRT 

<213> Homo sapiens 



<400> 157 

Leu Ala Ser Pro Phe Pro Val Pro Leu His Arg Cys Ser Ala 
^5 10 



<210> 158 
<211> 29 
<212> PRT 

<213> Homo sapiens 



<400> 158 

Met Arg Phe Cys Pro Tyr Ser His 
1 5 

Lys Asp He Arg His Glu Val Val 
20 



Arg Thr Arg Leu Val Leu Lys Ala 
10 15 

Asn He Asn Leu Arg 
25 



<210> 159 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 159 

Asn Lys Pro Glu Trp Tyr Tyr Thr Lys His Pro Phe Gly His He Pro 
^5 10 15 

Val Leu Glu Thr Ser Gin Cys Gin 
20 



<210> 160 
<211> 24 
<212> PRT 

<213> Homo sapiens 



<400> 160 

Lys Leu Phe Pro Tyr Asp Pro Tyr Glu Arg Ala Arg Gin Lys Met Leu 
^5 10 15 



Leu Glu Leu Phe Cys Lys Val Pro 
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20 



<210> 161 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<400> 161 

Val Ala Leu Arg Cys Gly Arg Glu Cys Thr Asn Leu Lys Ala Ala Leu 
15 10 15 

Arg Gin Glu Phe Ser Asn Leu Glu Glu 
20 25 



<210> 162 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 162 

Ser Met lie Asp Tyr Leu Leu Trp Pro Trp Phe Glu Arg Leu Asp Val 
15 10 15 

Tyr Gly lie Leu Asp Cys Val Ser 
20 



<210> 163 

<211> 60 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (15) 

<223> Xaa equals any of the naturally occurring L-amino acids 
<400> 163 

Ala Ala Gly Cys Val Trp Asp Thr Gly Leu Cys Glu Pro His Xaa Ser 
15 10 15 

Leu Arg Leu Trp lie Ser Ala Met Lys Trp Asp Pro Thr Val Cys Ala 
20 25 30 

Leu Leu Met Asp Lys Ser lie Phe Gin Gly Phe Leu Asn Leu Tyr Phe 
35 40 45 

Gin Asn Asn Pro Asn Ala Phe Asp Phe Gly Leu Cys 
50 55 60 



<210> 164 
<211> 180 
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<212> PRT 

<213> Homo sapiens 

<400> 164 

Val Tyr Leu Phe Leu Thr Tyr Arg Gin Ala Val Val He Ala Leu Leu 
1 5 10 15 

Val Lys Val Gly Val He Ser Glu Lys His Thr Trp Glu Trp Gin Thr 

20 25 30 

Val Glu Ala Val Ala Thr Gly Leu Gin Asp Phe He He Cys He Glu 
35 40 45 

Met Phe Leu Ala Ala He Ala His His Tyr Thr Phe Ser Tyr Lys Pro 
S° 55 60 

Tyr Val Gin Glu Ala Glu Glu Gly Ser Cys Phe Asp Ser Phe Leu Ala 



65 70 



75 80 



Met Trp Asp Val Ser Asp He Arg Asp Asp He Ser Glu Gin Val Arg 
85 90 95 

His Val Gly Arg Thr Val Arg Gly His Pro Arg Lys Lys Leu Phe Pro 
100 105 110 

Glu Asp Gin Asp Gin Asn Glu His Thr Ser Leu Leu Ser Ser Ser Ser 

120 125 

Gin Asp Ala He Ser He Ala Ser Ser Met Pro Pro Ser Pro Met Gly 
^30 135 140 

His Tyr Gin Gly Phe Gly His Thr Val Thr Pro Gin Thr Thr Pro Thr 



1*5 150 155 



160 



Thr Ala Lys He Ser Asp Glu He Leu Ser Asp Thr He Gly Glu Lys 
165 170 175 



Lys Glu Pro Ser 
180 



<210> 165 
<211> 176 
<212> PRT 
<213> Homo sapiens 

<400> 165 



Thr Asn Asn Lys Asp Ser Leu Gly Trp Tyr Leu Phe Thr Val Leu Asp 

5 10 15 

Ser Trp He Ala Leu Lys Tyr Pro Gly He Ala He Tyr Val Asp Thr 
20 25 30 

Cys Arg Glu Cys Tyr Glu Ala Tyr Val He Tyr Asn Phe Met Gly Phe 
35 40 45 
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Leu Thr Asn Tyr Leu Thr Asn Arg Tyr Pro Asn Leu Val Leu lie Leu 
50 55 60 

Glu Ala Lys Asp Gin Gin Lys His Phe Pro Pro Leu Cys Cys Cys Pro 
65 70 75 80 

Pro Trp Ala Met Gly Glu Val Leu Leu Phe Arg Cys Lys Leu Ser Val 
85 90 95 

Leu Gin Tyr Thr Val Val Arg Pro Phe Thr Thr He Val Ala Leu He 
100 105 110 

Cys Glu Leu Leu Gly He Tyr Asp Glu Gly Asn Phe Ser Phe Ser Asn 
115 120 125 

Ala Trp Thr Tyr Leu Val He He Asn Asn Met Ser Gin Leu Phe Ala 
130 135 140 

Met Tyr Cys Leu Leu Leu Phe Tyr Lys Val Leu Lys Glu Glu Leu Ser 
145 150 155 160 

Pro He Gin Pro Val Gly Lys Phe Leu Cys Val Lys Leu Val Val Phe 
165 170 175 



<210> 166 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<400> 166 

Gin Asn Ser Gin Arg Thr Gly Leu Pro He Thr He Phe Ser Arg Ser 
15 10 15 

Phe Pro Leu Leu Thr Gly Ser Asp Leu Cys Glu Asn 
20 25 



<210> 167 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 167 

Gin Phe Phe Leu Cys Arg Asp Cys Ser 
1 5 



<210> 168 
<211> 38 
<212> PRT 
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<213> Homo sapiens 



<400> 168 
Glu Arg Glu Ser 
1 

Leu Ser Ser Leu 
20 

Leu Ser Leu Pro 
35 



Cys Ser lie He 
5 

Arg Pro Pro Pro 
Ser Ser 



Gin Ala Gly Val 
10 

Pro Gly Phe Lys 
25 



Gin Trp Cys Asn 
15 

Gin Phe Ser His 
30 



<210> 169 
<211> 116 
<212> PRT 
<213> Homo sapiens 



<400> 169 
Leu Arg Glu Asn 
1 

He Ser Ala His 
20 

Thr Ser Ala Ser 

35 

Gin Pro He Phe 
50 

Gin Ala His Leu 
65 

Ser Gin Ser Ala 



Leu Leu Trp Phe 
100 

Gly Ser Arg Lys 
115 



Leu Ala Leu Ser 
5 

Cys Asp Leu His 



Gin Val Val Arg 
40 

Val Phe Leu Val 
55 

Lys Gin Leu Thr 
70 

Gly He Thr Gly 
85 

Tyr Leu Tyr Lys 



Ser Arg Leu Glu 
10 

Leu Leu Gly Ser 
25 

Thr Thr Gly Ala 



Glu Thr Gly Phe 
60 

Ser Arg Tyr Pro 
75 

Met Ser Tyr Arg 
90 

Gin Phe Lys Gin 
105 



Cys Ser Gly Ala 
15 

Ser Asn Ser Pro 
30 

His His Gin Ala 
45 

His His Val Gly 



Pro His Leu Ala 
80 

Thr Gin Pro Lys 
95 

Tyr Arg Glu Val 
110 



<210> 170 

<211> 25 

<212> PRT 

<213> Homo sapiens 



<400> 170 

Ser Ser Arg Leu Glu Cys Ser Gly Ala He Ser Ala His Cys Asp Leu 
15 10 15 

His Leu Leu Gly Ser Ser Asn Ser Pro 
20 25 
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<210> 171 
<211> 40 
<212> PRT 

<213> Homo sapiens 
<400> 171 

Gly Ala His His Gin Ala Gin Pro lie Phe Val Phe Leu Val Glu Thr 
1 5 10 15 

Gly Phe His His Val Gly Gin Ala His Leu Lys Gin Leu Thr Ser Arg 
20 25 30 

Tyr Pro Pro His Leu Ala Ser Gin 
35 40 



<210> 172 
<211> 25 
<212> PRT 

<213> Homo sapiens 

<400> 172 

lie Thr Gly Met Ser Tyr Arg Thr Gin Pro Lys Leu Leu Trp Phe Tyr 
15 10 15 

Leu Tyr Lys Gin Phe Lys Gin Tyr Arg 
20 25 



<210> 173 
<211> 25 
<212> PRT 

<213> Homo sapiens 

<400> 173 

Glu Asn Phe Pro Glu Thr Arg Glu Val Arg Ala Phe Ser Pro Arg Glu 
15 10 15 

Asn Leu Glu Leu Cys Thr Cys Lys Ser 
20 25 



<210> 174 
<211> 11 
<212> PRT 

<213> Homo sapiens 

<400> 174 

Ala Leu Tyr Cys Ser Pro Ser Leu Gin lie Asp 
15 10 



<210> 175 
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<211> 37 

<212> PRT 

<213> Homo sapiens 



<400> 175 

Cys His Cys Ser Met Leu Lys Ser 
1 5 

Thr Leu Phe Val Thr Val Leu Ser 
20 

Gin Lys Lys Leu Arg 
35 



His Gly Asp Val Gin Asn Val Leu 
10 15 

Asp Val Ser Tyr Leu Gin Gin He 
25 30 



<210> 176 
<211> 39 
<212> PRT 

<213> Homo sapiens 



<400> 176 

Cys Tyr Phe His Gin Lys Ala Gin 
1 5 

Lys Glu Gly Val He Gin Asn Phe 
20 

Lys Lys Glu Lys Lys Lys Lys 
35 



Ser Asn Gly Pro Glu Lys Gin Glu 
10 15 

Lys Arg Thr Leu Ser Lys Lys Glu 
25 30 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule I3bis) 



A The indications made below relate to the microorganism referred to in the description 



on page 



54 



, line 



N/A 



B. IDENTIFICATIONOFDEPOSrr 



Further deposits are identified on an additional sheet | | 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manassas, Virginia 201 1 0*2209 
United States of America 



Date of deposit 



February 12, 1998 



Accession Number 



209627 



C ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D- DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



EUROPE 

In respect to those designations in which a European Patent is sought a sample of the deposited 
microorganism will be made available until the publication of the mention of the g rant of the European 
patent or until the date on which application has been refused or withdrawn or is deemed to be withdrawn, 
only by the issue of such a sample to an expert nominated by the person requesting the sample (Rule 28 (4) 
EPC). 



E SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the Internationa! Bureau later (^cijy the general nature of the indications e.g., "Accession 
Number of Deposit" } 



For receiving Office use only 



[^1 This sheet was received with the international application 



Authonzedofficer paralegal Specialist 

tAPD-PCT Operaiions 
(703)306-3745 



For International Bureau useonly 



I I This sheet was received by the International Bureau on : 



Authorized officer 



Form PCT/RO/134 (July 1992) 



• 

PCTAJS99/0S721 



CANADA 

The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used. That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the fiimishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the fiimishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 16 months from the priority 
date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the fiimishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
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t:tg«it«<?t.gci 
^CQctccgjici 
agaCicracL'l.g 
got; tczKt goat 
tgateacatd 



cagofctggea 

Lctggbiacicc 

gtgg^catgt 

cagyttciiga 

gLcgggtgca 

t.tctagagaa 

aaggtgagiiL 

cctAbg?ig«g 

acittggtasag 

gtaggccagg 

ctgzitQcassfl 

ci^ggataaaca 

gaLf.gtgfcgc 

ctgctgccgt 

gtggttatgei 

ag^bctcagc 

tRcccccsag 

tggUgfe.gt.gg 

^ggoccctgg 

taortagCQgg 

gcracgggaas 

gcgtratgga 

qcagaagcert 

esL-ggttatc 

g««gct£acac 

SJCcggce^tg 

^99Cb.gcecc 

tgtaaagcgt 

gcctgatg<sc 

tagtgAggac 



gi^tcctcctg 
e>gcaactgc<: 
gggf:Al:.c;jatc 
agflccgaaKc 
g^k^tattggg 
ttacrt^libntc 
■bctocciggcc 
gcagnrcctgtf 
gcgLgggboD 
tacctctcaag 
ggcccLgK^^ 

t. tt-CCilBtCC 

tatggccaaL 
ggat!!<::0gskcc 
ggeiQctcctg 
cgcaccct.^^t. 
f:AAgn«ga£5a 
tgckccfaccgg 
ggtgggaggc 
ggtgggaatg 
cagggtgtgb 
gcrcLdi;;t«oo 
bgtcgasaca 
ctffogttt<i*<: 
atcrgt<-a«ct 
gcotttgctt 
cggccacKcg 
gUgggc^LCCa 
ct.ggagcctg 
aGLai7tgt.c7t 
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i^ttrtcctagc: isi^tccctaca accac^ygr.f. <:a^trccat^gc ^lctCQcaf3Ct gLLT-oL*a?icc 186D 

aLAfccrt-cctc ggtacgtgcL ijLgtjQtgtgt ggggMttgc? caccccaggL ygmr^toaco 1920 

5t.cotc«gcsc agj?ccLgtti:K gccc^citgtac tatctflcggjj ggtrS^ggscb s^^g^itacact LS^O 

gcttcagcat c£it.ggtbiict ccggacccca gcacccc^itis ccgcctc-rjcc tGc<raLgtc.» 204O 

tgea<rcthb.rj gtc^ccaccgci ctag«t.s<*gt «t.bgggoOTg qcaacgcaat i^ogcw^tc^ga 2i00 

f.ggtbaaggt scaaccca^bg ttcccj^jgtot^at atatgctcct gg^t^fcgggts tggtatgggt 2L60 

oatgoagatg ctccri;;^ggct tgcaggctcc acctgacragr. tgcatgt-tDt etc Lgcsg^jt 2220 

acatgtcfccL tgc&gtgt^it gaac:LLg»c:<; eigcccggcct tg^ccccctt gl.ggctgcag 2260 

cci_gtfigtga tcjgffgccgta wgntctttcc tttacangat tctgtrg<r:gigrM ttotgcagct 2^41) 

ccsttgctgsa accttcscac-c eitaagcratg tgtccteaag gtcoactcct Ctacacr&cg^ 24 00 

gigcacccranc: rsagaggcsaaB ggcU:<:fcc:c:K gtgccigcgca ntactgstgg <:;« gf:c tggc t 24 6 D 

Ktrstgggatc tcaccraccat gct-JiiSQCCEifci: aactccactg tIi^:hgg^^goc: tccscrcsgat 2520 

cctgggcttc rcrtrf^ccgoct tg^rcacec'cc: Lcrttttgnctc tccaCTSJCcea eagnihgtggb 2580 

fiL<:ti?icagcc tRcacaccLL g<:^?o5cocDt gagcGCcacc atctcg Lggr^ cagtcjacag^t 2<i40 

SQagatggali cccLcciatgt cttcsstgctt gctgtggag^s tgctacagcc a^iaag£i<jgc;b 2700 

j^t.^ggfigagg ctgggctGSjt acccn^&gEit.g c:gt.gtg<?tnG agraaatactc tg b r-cc etc t 2760 

9Cai>ptgcts cccatgtga<:: tiggoc;t-caag Gtcctaagcc craag<:ifltr-*>b ggtctcagcc 2820 

tccattoatc aacggctgnc cttctsacgt ctggggcal.g gtgaaccciae cttcaLgaot 2880 

agcactgligt. twzitgtacc tgatg Ui^grh gncjp t-ggar-t CTCtggcctcjL gwgcDctgac 29flO 

trJ:ggr2cacc cjttgtgcctiL t-gggggtcag Gcaai^ttgagg LLi_tic;^?flr.t.g gtatgactga 3000 

ggtotcctac gglrtfgi^t.ggc gtcctgj^aca tggggccLgc tcacagacacj catcfgagvpg "^060 

asfE^agg^ctg t.c^bgtgccca tgctcagfiaL isc:c:ttg?>ggg oaggaggtgg t:£jgc;r^gtggg 3120 

tliccL^(^tgb coCTtgcegga gotgaaggtg agtgaaijtgc t.g<rcajag^<*t atgcccaact ilHO 

ccccQfcgaca agaot^gaact ttgtaacaaa cagtacesafc ttattttOTC cgtgggtLfch 3240 

Curette ttLt. ccagttgatg acttLgt-gaa issttcccogg tatcgjjagcc Lctgfcggccit 33D0 

taaaLgtggc tcagtggagg g^gacccjagc atagccaggc cagLaLggwg cucctcacirc 3i6o 

a-pasctctca gaagc-tgcag gcuKacgaac atctgaecaa ^sgagutgtgg tcj^aggctcc 3420 

tgaaagagaA ^jisggcctgct ggtcl.^:at.^;i;; trtgcbtcct ttgcctttac ccn-wl-.^iijcbcs 3450 
Lctgcocgtc ccacccrrgtb ttgc?tgt<rt« ctcaccccca ggAtgt.stac ccugttsstag 

te^aa^ctga a^tQcatgct ffagcugtacfc Aggaa^^ttgc atatctagaa acagegaekg S&OO 
a@tcai:;:tggc ccatct-cttL gcrbcttgtgc cccaggccag aatiaaagaaL ug^gigtara 
gt:rciaaaaaa aaaa^^aaana ^laaaaactcjgf ag 



<210> 16 
<311> 1428 
<212> DWA 



<4l>g> 16 

agoacagttt. 

ca-iTtgLgtM 

t-ggtggoaaa 

t-ggogngaaa 

aactaagatc 

tgtt^e^gaca 

^gabgcattia 

tcsjcctcatg 

ggatgtagcg 

gttaataacfc 

Ctttcitt&gt 

aaggcagcat 

cbijtttcc&c 

taggt^dtgt 



g^gcctcctq 
ganaacagt::^ 
tzciDi^^aggaa 
tgnsttttct 
ctaaacctgL 
gtatttcbtt 
tQctcgtacc 
cctttcccrtc: 
ggaoattcct 
tggtcttgaa 
SGgatai::«eb« 
tactgcttta 
aaccttagqa 
agaaatAttC 
atQpdgoaagt 



^agacattga 
htboctaattc 
QCttatggcc 
aaggtgLggM 
Agt-ttcaga^^ 
ttggt£5gtgg 
gtgacrtccca 

gtaacract-tt 

ggtsceitgac 
actttagaad 
ntgaaggt.<:tb 
nccctctgsc 
ctgcacgagc 



Atttgwggat 
cagaggacsaa 
gC tcfiLcra^a 
g^tttbgacb 
tgcaaaagaa 
tatctttgce 

tagactgctg 
atccgtdOcit 
ccktgtctacc 
a^^cattaaccr 
gaobbeicstat 
ttcattttct 
ttaagcrt.ggi 
ttc7tbgtceig 



tacactcagc 
ggcaaafic.-ng 

tgaccnttta 
gaagcl.Kgbg 
Rtggccctgt 
atactgacct 
gt-tSiccttgg 
c-tactggaaa 
ratgaaejitb 
bggsg*>.^?5t a 
bt:att.aatt.t 
cccatcaagc 
libagc&gCaac 
owtttcttg 



craaLgaz&gc^g 
apctcaaatt 
tactgatggi- 
gLrrt:tw9gafla 
tgctacctta 
£atcttattt<; 
<: L: riuflc^cg b t 
ctgisgagsga 
t:cgt.l:apvab 
tttcnttttc 
Ciaatatgat^ 
aaac^igiactg 
tatgttagtLt 
taaciiag^gc 
crtcbtctgtc 



60 
120 
IBO 
24 D 
3Dn 
3GD 
d20 

5d0 

<iO0 
5&0 
720 
780 
B40 
90O 



gc&LLacith:?! ctaaftct^Kcr. caactctcat cat&LLctLc ^ttta^iccac ctcct^cipts 9^0 

ttttcttttg gacti<jbggcc taaaattte^ ttgtttatst tttacttgcQ LLijgcstttca ID^o 

oatattattl. 5*>t.rfcttatt Ll.l-.gtttdgt qtcttcciigt 1 1<: l:^5,t 1 1 1 te^ctctgtca IDBO 

cggctop^^tc tcctacaiigb «g<?ttatgtc ccctl-L^ac:« tcccccc^atc a^icctci^^cc^? 1140 

tcoccctcct gfcct:r;tgcct caccctct.gt: t.gbt.ccc^inc gecccccagt r^bccD^gca l2llo 

gcaaaagg^a Qccctctcct: nwc^gcttcoa cga^ittcatj^ ac^citni-ii^i^st? ccaaaatoc*j 12 GO 
ccgchbcnsg affccta^wgc gtc^gtaag ttcUgt^i.^ing gaatoctot^r tcLctijscgt 

gcbttOCTttg ffftUjt.ttgct tcHgcAt^i^r. t^attttgtt tgtQaar.g&ij tccattRtgL I'Siio 
tttcccaCurt catataaaaa iJgt.t.Maaaa aaaaaaaeaa aai^tcgeis 



<23J > 1465 
*21i> i]unir> 



<230> 
<221> BIT1D 
<222> nj 

<223> n equ*iJ.fs a,t,!T, or c 
<22Q> 

<$2l> Bite 
<222> 

^223> Q ecuala a,t,g, ox c 
<22{}> 

<221> SITE 
<222> [549> 

<223> n ©Quals a^b.g, or c 

<220> 
<22l> 

<£22> (IdOf^l 

<223> n «±i2ualB a.t.g, cir c 

■sr221> SITB 

«222> (1477) 

<223> Ji equals a,t,g, or o 
<22Q> 

<321^ SITE 

^222> Ci48BJ 

<22S> n equfll^ Ci.t.g, or c 

r4{)0> 17 



ffSfi^ggonagg fttgatgatg^i aQcjoc^cgtac acaccattcg ac«ccccctc asiGtaaacL^ 
gaaacagtga aatgggcgtt cacctgcrccg ctgagi:tU:g tcttatactt cacrtgtaccc 
aact^caaca agccgcgctg ggagaa^hgg tbcatg^jtca cgtttgcttc ctccaogctg 
t^gatcgcag Qctcctetli?> catgatggtg tggatggtca caobcattgs ctacaccctg 
gggattcfctg acj^itdrptcat ^jgggatcac cr iit<totggof.g Qtgggacirag cgtgcctgac 
cgcatggcca gcctcattgt ggccagacAo rggatcgggg acatngctgt gdcaaactcc 
atfcgggagca ac^gtgtttga iiatcc^^abt ly^cctcggtc tcccctgggc tctgcagai^i? 
cbggctgtgg afctacggabo ctacofcccag ctgaats^^c^a ^ffgggctffat ctai;tcogta 
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ggcH-LgrJK-.c ta?iccto^;gt. tbttgtcocfs crtgttcs^gcrg iTcn^^cotRac^ caaqt^ijciMg 540 

I7.tg^3CiC^laIl^a agcl:gsgcts3 taaJTtgccitt <n.c:c:Kcjt<iCg gt t.g 1 1 cc L gl-gcttctcc 600 

Efctcatciactg agbccstacgt gtLcutiobtt. gcgnotcctac ccai-ghg^gg ggnccactgcs 6faO 

acc^i;<:gg5b ccccacag^r gctc?aRctM ttcttttcLg tgc^nstocf^a i'acccgijt;ng 720 

ccicccciartc aciat^flggccc ct.^g<^ifiz<zar. sgcgt-tcgto tctccLgtgc: bgtcctcai^g 

cct<:rc'gc:i.t:c tgttttgsgtg gt:ccarcct-!r tcccctgccc ft/atcct-cgct cccccsci^t.^; 94 0 

"^t-bgggtcat. acccar.ccoc cctttcctgc L^Li;4:i.<togt.g tkaaefacatc c^iwr-^t-ocac 900 

gtgocttttc ^ngctccatc ttt.g£i«ti?a.gb BrMtgagatt cHagaiA^ci^i:^ ccggctgcrta 

acl:ggi::tit*j?% gccaggcaa^: pn^tgcittcco atccctyytit: r-t-bbtttaag ttat1J.gr?t.g 1020 

<jnpgeictcac ccaaLht-gtg acctgagacL g lihg^ o ga as tacaagaggag ggggcccgtt lOSO 

gattacaciac# ay np^t 1 1 eg attLligbtt-g gtttggtSCTati gatgi:i:t-«gg ttoctcsggtt IL^O 

tgggggijrtbt gCtttcttlJ- gggggccttc cccttttaci. ri;;st.tt.tctt.c cagagaL<:«ti I20D 

gflgcttcitct t(Tcat.^:btr?t tccacr.gggc inrvc^gatt-flp tcix^Klitaccc aaAgtjr^t.gCfit 1260 

cctsccgtgt t-gtctgOTCt tgcatif:^:<:Mg ;atgtgttoaa gtatgcal.gg eitgtQrTtgct 

ttttagaggrt gcc-actgggc A«ggccQcca a^iaacaaatg ci? t-gacatt t tatagctiirtg 13B0 

g«r,-scctMC taaagt^sbtw. tgggcfgtifii: c cuggggbt-gg gOTBfJcacaa gyiibbt-ggag 1^40 

5oaffaa?raca acttwtCQt tccalCiitf-.H ccatctnttc cLcrai^taTig 1^69 



<2L0> 18 
-c2il> i^^O 



<^00> IB 

acgcgtccgr. tbcccagaaa ^Ljag^i^tgaca tca^tgcc-cc Ct.Qi^bocsttb ctcagtccLc 60 

acrf-fthtgctt tgagggcMn ggtactgaaa cLggr-Cybct bgactcttgt gtcagrttbt- 120 

tctccaat.c<r aLbotccccc tec crl-t.gc b t. cbgacgcagt ctaggttafi« rst7%gccac«3c 160 

aggtagttgt ggoactggtg <ibbbtceaaa flccccacttt agcigrtboagg ccacagctLL 240 

tt£it.»bc5ca cagg^Aacot cffigcct^agc Lgtit-.g^ic^tcfl t^cctgtttc ccrcaggMisc-c 3 00 

tccictccttt cygtagaacct tgggat.t.t.r,« g<la^^ttgta?^ ctttccataa cti^wbttact 3 60 

ccaacagth.g aaottacaca c:?ittgc?tccc aaacttggaa ab^flg^Hcoaca gtaccttacc 420 

tr.LORtbccc catctggcct ttaccttctL i.ectt^;Agtg pttaaaaaca gttgac.^tvit 480 

tc«aa$itate gt-agQt:tt.ca acrcLcr^c^nca aatgacaaj^t cccrat-tttwu oatcctajjga 5d0 

aggccrtizAcr^A atttcattin; acjgcgccaj^g gcggctgcrag ttggassgGcg agggtcegctir: 60<i 

Ctf.gctcact gaatgtcttg cati^tgct^is r>t.gctgc<7C!g cagcgctgaa caLgcicccac 660 

cgcccaggcc c*agcocCgct. tgttgggtca gcatctagtj^ ctg<2rfc5tcnri fst^crtt.tgt ct. 720 

gcacagcc:<?g baggatLgf.f: boagccaggj^ ggntcatcag ftaggtgtgca aggccttUgg 780 

ggjcaDtgag cccctotogt gggcagL<:tn ct-ttoccttc ccaccteccL i^^^aagcaca fidO 

gaaeacagtij ccttggtttg tgbtttgaag caaacaagtc Agcstbtct-gg ctttgcccca 90O 

aaactgtgot ggaacAtaat D.aaactggag ^ib.^tggttbt fcaacactgca aaaaggaoaa 560 

Agcatcaagc tbcbacttct ggcLggwaog caaaaccaat cCcag^^tg^c zsjiggctcggc 1020 

aaactaagbh ttcctgagcc tsiattttcctc tgagccctga c-<?tagoctsg Dcttacctoa D.OBO 

ttaaggbttg pttaa^jcQg tguaaaggag g<iggRggcag oogtggatgg gggirgtgggg 1140 

cigggnatgag c^^ctotgcag ccgattRatc tjjttggcagg ggect^igcitt ottsjggagtg ISOO 

cttattcagc ccaagagt&g aggctgttta cagcgagci^c tggaEatgcfc agctLgbc^tc 12 60 

cragr.tgggsB ggggbcaggc ccctaaaliV.g aagoccoctt. tggtagcaga <?c?bgtagRaa 1320 

cstBgCijagt-iit oactcacaga tfcptgceigca gctgctccac acrtcaofctaaa gcaascgccg 1380 

acaggc;tQga ccccag«ccg cagRFggctgc rQc^tacaagg t^ggaccaca ggc:i;gccitca l^dO 

cc^gggattgt Lbgcicactaa atagcbggag tcacagattg agafc^Mbgc coccttcaag 1500 

gttLgcagtgji. aatagcataat- cctatst^jat gaatLtatat gtgbtattct ttaaafi^agfjb 1560 

attLbflttac tgcatsttcci cgtcccgtcrl. hgcgoatgcg actccccgccr iic:c:^i7gcgaj^ 1620 

gbgcagtcgt tgc^gcggct ggtgcaggag tgccactggc gcgtgtstgei tagcatctcg 168 0 

taggtgtbgi:^ tgcacaagAg tt^nccagasi tcaatgccaa at;«fcciitast.a tgagfi^gcgt 17 dO 

actttfctaog aaatt^attt attcgajatct ««»;5itoCttt:t gaaatataaa etttbtggtcjit IfiftO 
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ffittttttaea cj^;tota£ittc ttcrL^gacet trtgtsotta aaaatttgai. i.gtgcttstt 
aaaa£5L<jgt«; atctatgLi.i: r-.gi-?icttcaa ctscatj^aaa au^awaf.ttc tttMc^aaM 



1310 



<2L0N 19 

<2il> 1B92 

<2±2> DMA 

<213> Homo eapleii:? 



<dOD> 19 
tatttgagcc 

^ t<:^c:t.celt.!3c 
socage ttgL* 
gGaa.g9«9t;gg 

asaagactt.o 
aggatggcjgt 
gaaetccagcg 
ccggtgagtei 

actggDcace 

caccdaace^g 
cbgaga^acg 

gbat.aAgacc 
tgvta^gags 
cacteftLt^tn 

gtactcccag 
tgcatgr?agc:t 
ctt^aaQaaa 



gagcrattbfctn 
a bgrta t^jt g 
ctgcagtt^ I. 

ttagbgE?tcanf 
ctgagagcLg 
catcot^-t-ga 

asjtgggll.cc; 
aa^g<.:caa[3tg 
cc^cgcatgag 
gcatgaagt.c 
c^^^gccoaca 
gggsiaggtct! 
tccrt^t.<7br.o 

aagtcaticLrr 
gtgtgagtga 
ag bgsuicag t 
aac^atacagt. 
aag^gr.agco 
ggtatccacc 
agcctttacg 
acatitvcbtct 
c«ttttggga 
atgatc^Cgn 
aa^^aaaaoaa 



K<;t.gt.«igtgt 
tottgtcccc 
tgaggtgpbg 
gcfl.gtsccca 
stiaaacatcc 
ccagt-gcbgt. 
atgbgccgai? 
aeagccagcrc 
gi^l:c:bt.ggcn 
<?aac3ttccLc 
ctgtccriigtg 
ci^gbttccct 
coaaggttgg 
tgactcgacb 
t.cfc ttcctgc 
gcatctcagg 
aaLgatagK^<a 
n ftca atngtc 
gcagaaacaa 
aacLtclib^t 
gt-tQdtttaoa 
tgaaaataaa 
ggcLgaggc^ 
cactgcactc 
aaaaaaaii^^ 



cc^glt^abuc: 
gbtotttaat 
ttgctctgag 
attttgf.'cgti 
gtggggbgga 
cccaacag^t 
atcacgn^ligg 
ggcsacgtscaa 
gataagggtig 
cg^gggrpgg 
ccCggggctc 
caggtcggag 
teicgbgccsc 
cogsTRctgac 
gcagggLgng 
ctccgggacsD 
agcccagQct 

gr?bgciagbg<3 
atactaatag 
cagcaagagi. 
atgttbatoc 
ccctaatccL 
aacacag^uc 
ggagga&tac 
cagcctgg^tt 



ggt.Qccttga 
ataaaaaLgg 

tgaarracctg 
gcjggcciiagg 

tcctcgijtac 
catggggn^g 
ciot.ggauctg 
ffRsccCggaa 

accuse -rjittc 
gcsa«tacatt 

ggatgcrbgcc 
ct.t.gsQQtjra 
cagcgccagc: 

tggfti;;wbC3. 

gggcacaagc 
caaCgtaatg 

t.gtek2) aaaga 
aaagctgact 

acstgcajstg 
ttgagcci:::Ag 
Aatctcagsfja 



ca auciti^gekC 
^^oatttaUca 
Mctgagb^b 
agaggbgggt 
nbcacccgag 
catttctigtt 
ctiflcgocaat 
cccagsggt-g 

gcag9ggfl.gt 

1 -^rcagcatc 
^gtctgccgr: 

C:flcccitcaag 
^caacatLgi;^ 
aagggagc^bg 

gcctaagctir 
aaaagcagbg 
gL^^gba tcag 
aattfscgtga 
i^aagcagtcrt 
atgflRcccao 
ctattgtgtg 
gctcatactt 
gngtt-cgaac 
gaccctgfcicit. 



60 
17.0 
IBO 
240 
300 
3&0 
d2n 
^^0 
EdD 

720 

vho 

90O 

^£0 
1{)20 
ICIBO 
1140 

1260 
1320 
33^40 
14^0 

1560 
ILETX 



<210> 20 

<211> 1410 

<^12> milL 

<213> HDina sapiens 



<4Da> 20 

gcccacgcgL 

tactttcttt 

gga^tcoctt 

dtcdactttf. 

tttcttgtbt 

ttaggtcagg 

<^tgaat.t.gaa 

atpaate^tta 



CDgagaaaaa 
ccatgtgocc 
aat^agagata 
bbact.gt.cacr 
acCac&i^cbc 
tactatttct 
bbctcctcag 
aataacttca 
tira^gtgttt 



tgr:bgr=bc«g 
eictgtgtgta 
cttgtaaaaa 
Lir-gcesgtttt 
caatttgtag 
gccatttctc 

Cciktacabcat 
gtaaactaca 



tttttattgt 
cccgttgcAc 
ggciDtbttgt 
tagattctct 
tgaaaagaab 
agtctcgtan 
gagat.agcaa 

gagbgoaa^i- 
oagagtsitct 



cla<:cP5it.gg 
atcttcctgta 
ttttctaLa?; 
gata^sgatbct 
gggctgctat 
fccttgggttig 
tttbbbattQ 
gr^t.cctabaaa 
caaactgcrag 



tf^agtataca 
gcceaggAgei 
vgckatgbasa 
ctgatagcaa 
accafcbggi^t 
gtto ctaaca 
tagtattatt. 
tgttaattta 
dt^acaaaata 



GO 
120 
1 RO 
240 
30D 
360 

b40 
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ctcaaaggat ggtctccatL ttcA^gataco 
caliL^asc:^^^^ ttaccLhc^tt cccaecaatc; 
t-gtbcnctta ccAugsgaac atttt LUgi:ii 
tEjaa^itj^cat Lggt-Dtaatc ae:6gi;;t-CQDt 
aagt.agi:rl:yg gciccacag<i t gagtgccacc 
gtagj:?gQctg sifftaatatgg ggtggaac^ia 
ggtscaafffia t1.g<;tt.oggc cagg^ngpgbo 
cact:ccac^(^<: tgggc^iaca<^ Mntgogacct 
aaciiLisbpr.g 1 1 tt 1 1 1-t bg tttttttttt 
esonocgactg ggAJ^igatgca?! gcttcsggt-iifl 
gccaccacrcrt: botctagga^ gflgttcct£?c 
tcLgijgtbat g^ragaagt-bg aaaattgttt. 
bccgatttta aLgti«««ct.t ttcaijttabbc: 
ttt:acl:t.l.<^r:i ctcctaaaac: tgccttttcc 
ligfl^ATinaaa aaaaMuj^taas aaaaaa^aaa 



ctatagsisga gc^stii-.bbctt acttgatcfit eOl? 

cacatggdii.g 5««ggao£=itt. cetcU-.r;t«c. 660 

gbtttbtgcig acaatgtcLg Lc:gc!ccac£JC 

gcc^ucctcga ccl-irtcibrtcci tcaRtcCccL 760 

scacccggc'L M^bbttpttoa saacraLLbtt. B40 

9^:t:i.gt.«gt.c <?caKatact.<: aggwggctifa 90U 

w«ggctg^af$ tgagctrgbgca sioggccaotg 960 

tdtcticaaaa a^Maaaaaa aj^tttcLbgg 1020 

ga&£s£iLfc:c:flg occttqtgcc cLLgf-brtgn lOSO 

cagccaaacc tggctgl.irug ctgtgtggga 3 "1 40 

Ctctcrtatgg tf:rt(agrQbaaK tcaagct.cag 1200 

L9LL<:<:t.r.4b t«gCtt.ataa tityhrttgaaa l2<>0 

acgtt.tgtot ageLal-l:br,-3 gflgixacrcat.t 3 320 

t.a£ggttr.l:g bcaataaaac ectal.gobgt 1380 



<210> 21 
'^211> n2V 
<212> n«A 

<:3I1> Hcono &api<3Ti& 

<223> SITE 
<222> (979) 

<225> n «<3uaLB a,b,g, or c 
<22D> 

<222> (1047) 

<223> n et^u&ls a.t.g. c 

<.221> S3TF. 
<222> ai3S) 

<i:i3i' 11 e^ual.E a.tjg* or c 
<dO0> 21 

Ct^acgogtc^' ggccaCggtL gccactgtct gtjsgcctcrct ggtcbtcctg agcctgggtt; 60 

tgiytaccccc agtccgctgc ctgtttgcec l-Cft9<^&bgciD caccccgggt aligg^igcaoc 1^0 

gtrceccggct. gc^bcctgt.cc tacreLgcftcbg ccaccctgac cat-bectgtg gtgcccaacg 180 

tcctggMHs cgtgggugcsg gccfggsacagg cgctgaggLg bgtcaccgag ggictKirictgg 240 

agagtcboot caatcccact caccagctcjtt fltgcogcotc cagggcterlLg ggccccacag 3fl0 

gci^piggcagg cagcccgggc ^^tgcscRbctc aggcccaoga cfi^^tggctct gccttctacc 3tD 

bbcacal:g<rb caoggt.M<?b caccasjgtcc Lggaggal.i-.b ctct^i^cctg gagU^^wtgg 420 

cGcgggcagc agcg^taggg acccagcgag bggtcocagg i^fcigtut^tg btgggcctcc ^SO 

tggtggagtc ggcacggtacr cfcr:catbgct acctsjacaga crcbgcggttt gacaataL<;t. 540 

i^cgccactca acagccg^cc cngcggttgg cacaggcr.isrt gcctacacac cLr-obggccc 600 

ctGca<roi7QC ctggctgctc caggcggcL^; ^gcbgaggct gtcecage^^g gagctgttiga 660 

gttgbcttct ap.ggctgggg ctgctbgccc tgctoctcgt ^gp^coDggct gtggcggtgg 720 

ccoLcagacrcM Cat.agc<it.tc cbcctggcac aggcLactgb ggactgggct c^gp^gctgc 7Hg 

caactgbgcc catc^cgctc acggtcaagK atgcttgbggc acacactgtc ctiggsicttca B40 

tcdjbtccct CLtcaaccag crtggetccgc agagcccctt c<^tl^b<?c?otc cacagcbc^ct. 900 

^jccaatgggei gccccgcccc accbccgccc gt:iig<:tfrcact gotacccgcc ftggcgtcccc ?60 

gcgcagcbgc cccgetggnc gcggggggcc tgcagctcct. ggcgggctcc acggtgctctf: 1020 
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tg^asfecttst 



gal.c:«:gi7cac 
gciccctaggg 

ttgijwgt.tgc 

gciit-grttagg 
gogagctcat 
ctctactti^l: 

t-L£it.Rt.gCiDt 

taaeicgtatt 



f:i;gDssneitg 
ctacacgccc 

agauacctga 

gt.gacDtoicC 
cctaqataac 

ggt Li:i^ln;c:ia 

t;ago tgga$;a 
caaaa t cct c 
Hal. 1 r;:t. 



ccatCQcc^c 

gcgtccccac 
ccttaagaac 

»^t-5«gc!cC^D 
tcccaci^cag 

grgctggttOT 
ccttgcatea 



acccag&ci-ri. 

Cijsagflgcagt. 

Cggtcctgtss 
^rtttctacat 
ggAbgtggtc 
ccatCTcaaag 
cgatcaggcl. 
tgpatttctc 
tftta^^taacc 
aaa aa£s« 



aggcnocaeig 
y^mtgcaacc 
G^ilffi^agaag 
ccgcrcLctfM-t 

cig?Tat(^gaag 
caL^i^tttta 



1140 
120{) 

L330 
1530 

1500 

1620 
l&BO 
172-y 



<2LD> 57> 
<2L1> 1218 

<21S> HcniD B^plene 



<220> 

<221> STTB 
<22:i> (3B9) 
<223=* n equal 



s &r Si or c 



<^3C» 

<221> STTE 
<222> (7401 

<2:^3> n equals a,t..gr 01 c 
<220> 

<221> SITE 
<.222> U04BJ 

<223> n equals a,t,g, or c 



<dOO> 22 

tctaaaargg 
attceggggo 

agctcctgtc 
cttactttac 
attcgctgtc 
t-gg«i&99gga 
aacacatooa 
tli^^tbfcctca 

aatcgattgt 
<;ogttcaGgg 
ctgctgcgcg 
agaoacoaca 
tgga^gcagg 
acttgttoag 
Qtgtgtgcjft 
tggtagtaat 
tctatCMaa 



taaatgccicn 
i;7e4ct>ggatga 
t ttLcaaag^rji 
cractgtggtg 
tggtgggtgt 
actgagtiij^t 
ccrcttccttt 
gctceigcact 
g-ccattt&gb 
tgA^gccctt 
ctctRtcatg 
cgtaatgunc 
3(7gagt£»acn 
tgagccagcit. 

acttgtaccc 
ttcagtttge 
gttttagatt 
CHcagctctc 
aat.gai:^e^gg 



iicca'taagac 
caaaLggQ^^A 

ctgcctgggt: 
gtgtgggi755 

gctccctctc 
gtcgcgcant 

gggtgacatc 
tatiaccrtatL 
gciiccttttg 
aajTititmccctc 
gaggligf^5C^» 
cgctcctgag 
gt:ct.t.ggat:a 

ctctctctaa 
tgcctgtnac 
(rt:titt.aa9ta 
aootasocag 



taaa^ttt.^yt 
gcsttgatgct 
sggtggcctcr 
C^^o^gaccta 
tgctgttcct 
caccacggac 
Lhcte^gctbt 
gbtggtgaac 
attcactgtt 
an»bac:tbcQ 
ctcatgtgga 
tgasgtgg<M7 
gtsgcggggc 
tttccctttg 

Qcaaagacaa 
aflgtl:atca<i 
tt«ic5«ccog 
aacttatttt 



tgtocttttc 
gerctCaaLg 

ggcaggaccc 
j^tcrcacgiiLd 
"^gcagbcccc 
aoatgagjjag 
tcagggccrl.g 
♦■-tb^iaabgct. 
tagtatLgaw 
elihbflwggt.g 
acggcgggta 
agct.ggr:oi:^M 
bttgetcagca 
(jttggagLtic 
attCAtbctag 
t«igat.gaccc 
gaatactggc 
cagceant&ag 
cbAtQt.ttaC 



rccttcLgeig 

t-gccigctagg 
t£3tctggtgt 
tgggty^bgg 

ttcc-crtagtg 

agCLLa^gflfl 

c-tckcgtctigg 
gctgcagctcr 
tAftcbtngcb 
cagaaacacg 
tgattcgtijc 
ggtcgtgcag 
ttttQt^ttao 
Lageiggtggc 
c£*ttt.ggaac 
tctcratettt 
ttb-ctiaittta 
ac^taatitagg 



CO 
120 
L80 
240 

aoo 

360 
420 
d6D 
540 
600 
6^0 
720 
V&O 
640 
900 
960 
1020 
lOUO 
1L40 
1200 
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1 21 ft 

<2i0> 23 

<211> 712 

<2i2> DMA 

<213> HQirK> sapi^jn^ 

<223> n i=tq\icils a.L^c}, or e 

<221> SITE 
<i22> (2B) 

<223> n *qu<al& a,t,fg, or t: 

<2:2C» 

<?22l3' SITE 
c222> ^T7I 

<223s. n equal^i i^.t^Qr or c 

-c220> 

<222> 

<22:i> n equals a,trE3^ or c 
<220> 

<221> SITE 
<222> [124) 

<-223> TL Q^ale a,L.g, or c 
<220> 

■c221> 57.TE 
<222> (696) 

^223> n egk^Qls a.c.g, or c 
<400> 23 

t«i3EGCcgqg ocuCTttflcaa tttacncnijQ aaccgcLttg r^t^co benjECt ttgcaaae^jg 60 
ctcttit^ggc j^ccatt-uba^j ffiaagtaccee tgciaggt^cic ggtccggaat tccfCiggTitLsg 120 
aecfnacgcgt Mgoggaggt cytttA^aa ga.ct.^?tc^aaa ggci&&&tc-Cr. bgatccDccg LGO 
cscccaccclib acccctgcM t-cbcaccaga gcaaaattcra cfcgssoactt. t:tcccacc!^^c 240 
acatgeaaat ctcitccactc cgaatacctn^^ bgttttccat ttcaasttgt ^s^^gggagsjg ^00 
g&tggaacac ttcca^rt^at ggL^agagat ct^rttatiyaa ec^tn^ncaoo cc<?c(?iigt te 3 60 

ataacttMt ctfjaaattrtB tttttatgag ccgggcct^ct? tgtgcctcta gtatncbtst ^2(1 
attgactctc atagt.teiccc ttttagttbt act^jtattct gt.gaaaAt.bt gboattggtit 4 BO 

gngaatceet gtsrugcgtcrf: AttcbCattc aactaaatf:!. ccocaggttt tttgagotgg 54 0 

tgtggatta^ tctaartt^tt gcattcaacc at-tflgt-gr^tQ ccaccttetc ae^ttacaat 6O0 
acrae^ttactg gcaagcaagta ctgcisbfctcc tatCTcaacaa aaafiiggwRoa ataaaaaatl- 



660 



gctaatgcba aoaaaaa^M daaaaoaaaa aaaaanaaaa nassggcgsicc gc ?i:i 



<2L0> 2d 
<211> L422 
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<212> DNA 



<4DQ> 26 
t-aii^catcLct 

OTttcattLt 

t^tgpttacgt 
ctffotgaatt 

cccasaacct 
accactLLAc; 

Sttgacctta 
actgtn tacit 

gatLLt.scct 
L-t-tcctcata 
Qtagtctt.<j<^ 
ag1:tg^gc«t£) 

ggAQAtctga 
ataca^zaf^lig 



ctgtgco-cgg 

LLttgabcaa 

^etaattlitgc: 

tsgaahc-ctt 

siBtggc:t.atc 

sctgaaaccrg 

Kca Lgcfaani^ 

ac^g^oc^ga 

c-atcccagga 

aacacati^ttc 

tttrtiacgasa 

b ^tiact^^ctt 

cCgcagLagg 

tLttgcb^iat 

aQeiaaj^agc't. 

tcta^i^gcjr^t 

ggo^itttg-tt 

tj^cagacrggn 

tttctgtaca 
aatgaahat.b 
^^f^tgaacaa 
«lai3ltL^:tgac 
aaaaada q 



gaagatgqtg 
a caag^i t±An n 

etta^aaacag 

cgtl-lignEicn. 

ckgcpratttgc 
tigaaliLl.Hl-.M 

cigtocaatgc 

aaataaac^l. 

cl-Kc:<?t-gggn 

gaageti^LCe 

tactg^aggy 

gcuatcsatgsj 

CAaaa<7tcct 

tggl.l.rtygflfl. 

rtbgac€iaf5ca 

e^aagga Lag I: 

1-Lgr^totatc 

t^ggatccatg 

t:@aaaabc:hK 

l-t.bctg«eitt 

aaaaaaaaaa 



ctaggtgLrhh 
gctttttggsct 

gc tgt-gcttc 
ascttfiatga 
aa&gacegl.M 

goabgtsQOir 
^jCttcTtctca 
catigccttdg 
I'.gf.nbancigb 
tgtttt^i^t^r 
acgctLggfig 
ctgatgccaa 
cccacggaeg 
cLLtgishHt-c 
ggcttctttc? 
aaaagtgLt.g 
ggf.gpgtt.ta 
ttcagtcctt 

ttCCLCtt.gt 

r^^gcag^at 
taaatataga 
a aaaaa^t.^g 



goccg^iatca 
aatj^caaaga 
crcl-ii^: tgp^»g 

agttL-Laattg 
agaanbgtrt 
tgatcatgat 
ac<rcagtgga 

agl:t.r:ggcigg 

»tggjiQt Cija 
atacc^gaca 

gscbtEPsgat 

agacctfjatg 
gacLLLijgw;? 
bggggaatca 
caagctgtgg 

gaccctgafft 
aqaaatiggcrc 
aLa^^ttttbtn 

as 



cgccaLcmih 
gg^r;tgagail 
ncatgaaacc 
t cca ac:aM 19 9 
gcitbggaaag 
tttgcccCTtt 
catgatcLtiH 
Lt:i±hgt.tgga 
tcac^gcctac 
gt:gl:gcc:agh 
cl.Mc:ttcata 
ctgtttgggs 

ttgguggttt 
iL^rtflctggtffc 
ctcc^tcrttf: 

tttcaaccca 
aatLtgLhcri:^ 
*itgggtttcc 
ggQcctaaca 

gtiggflt.tgtK* 
ccaacttgga 



&0 
120 
IPO 
240 
3D0 

:i6n 

420 

54 0 
COO 

7ao 

900 
960 

lOBO 

ildo 

1260 
132D 
134iO 
1422 



<21£J> 25 

<2L1> L03ft 

<215> Hojnck sapi^iK 
<220> 

<22l> SITB 

<222> (&D£> 

<223> ji Qduals a.trgj 



ggcfacga^jtg 
5<)bDtgggaa 
Daggcctogg 
gctacgtg^ra 
c^gcbctactg 
tcggatcccc 
tgcgctcgca 
occccgcggo 
ccagcttctfc 
aggaggccag 
gcacagccaa 
cggtgctsgt 
agate baccg 
thgctgaagg 



gCt^i^agicrgQ 

cTccsffssgagcc 
caatatLgi^t- 
cceootsgat 
gaatig^gatt 
ctaccrt egcc 
eUgcctcacg 
ctacagcctg 

tgceebgggg 

agtgacogrg 

ctacccggert: 
ggccctc«cc 
gcagaoagcc 
«^«?t5l5Gcagc: 



or c 



ggccc gcgtg 
gaggbd^cgti 
bfct-agacagjy 
cccagcttLg 
gcaiiiigatggg 
tgctERCtctc 
caggccaf.gc: 
Sgci^tcgcgc 
t.t:cact.ggaa 

tigggci-^tcei 

C:oicflt.arcgg 
tccgggtctc 
t'tcctnigct-g 



gc.gcctcictg 
acagcccjjgt 
cagacttcLg 
iiggcstgocgt 
Okacacaagac 
taagerLccjc 
tg«gccagcc 
tcctgggact 
crLmrrr^t^gg 
ticat^Qctgga 
tgcacgccag 
ctctcctaLA 
dcokagaggag 
ccccaagtgg 



aggcggcccc 
ggcagggc^i-t 
i^gbbatgacc 
caLcaccatc- 
ccgca^gcbg 
catcctgctc 
caggalic^gag 
gffgcgtcgtg 
c gat. tact tc 
caaccci^ntg 

cgaagag^ccc 
ctfTattgag^: 
caggl:^<:^1^tgc 



cggal.geiQga 

gactgctgca 
cggctgctgg 
acttt.Dciotc 
ogcaggj^cct 

^gcctggnca 
ctegtgctLcL 

gg9at.c-./;tc« 
tuctDoggaa 

ttCac:r:go'tg 

tgcaacagct 
gcagggcga$ 



60 

I2n 
ISO 
2dD 

360 

480 

600 
G60 
720 
7H0 
B40 
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awtOTtocct gcrtgc CMg^r QCt^ccc^ay 5<-9*"-5y9f?t.9 occcafrtcjcc LLggMcctc 
ct^ccttgijcf gd^c^cctggfsc gL^ccrsj^cflt- otgijccattB agcLccinttCf;: ceicacecttt 
caLtiitnccan taaaggcmic: r-^gaccccaa aaaaaasaa^A n«viQa«iaaoa aaaaAMnQa.ii 



900 
1020 



<212> m^A 

<2n3> Homo aapitiTiis 



<40Q> 

ctQctttgtf 

t.tcia«ictttg 
acttctttcti 

ttaaataaag 
gcaaegcittc 
o^g^ct-OTaat 
ctcagggcct 
Bgaa«LcciDt£3 
cc^acgaaag 
actcteccRo 

aj^ctgct-cpt 

attttattat 
caggtg^waa 
tccstcacaot 
bc:tatcattt 

etccnoctat 
te^ttatcciiit. 
gee tact at t 
gcka&tcaagt 

tctcagaaaa 
^cacaatt&c 
qaaae«.ncbt^ 
aaftaaatttt 



gtgfigr;tcgc- 
t.tgaasjaca^ 

acB 9 t.cocsg 
<4ggant.tct.g 
accatgg^t.c 
atg-g^ngaEfs 
litgucasagc 
agtcl:cAc<;n 

geit-cg^cagg 
aqgaact'tt-bh 
eag-cz^c-cit-iac 
fltttsatacL 
ocwggtggnjs 
agat^cggctc 
gcacafiactc 
SJttacegiagg 
actgc^cggot 
c^tt:«»aa6faac 
actt.C4t;t.t<? 
tt-i^tgtcaag 
ct^gscagii/a 

bcccttaccL 
c^raaatgtac 
ugaoaaaacc 
aatgtarcdt. 

a^^aaaaaaaa 



Cjgccccgagc 
a&cg M.t.gcoEi. 
t-odgcacata 
taoitcrtcLH. 
99i%g09g«tog 
cct-gttcaag 
C(5Jt.9aa1_i5L<: 
acr^vbttct-t 
aoctcagaga 
cCcci-Lg^t-t. 
lirttgRtgatB 
caccccactg 
ccggl-.ec<?tg 
c«bcaagcgg 
ca$;jaac:asLf. 

t-ccQcaactg 
iragggcacea 
4:frt:ocbtBtcb 

caagatcraal: 
figh^gagac^ 
cctttgccac 

tgtttaattc 
atattaaaag 

taacgggcac 
atattttg^o 
tt^igoattoa 
cctctttgge 
aaaaiSi«i«anA 



£EA^agg<;r,^St 
cictgaziagcaa 
aagttagaai^ 
g»{9agt-'ttan. 

ccccctccea 
ctaacagAl.r: 
iig<;cggtgtg 
cacctaagaa 
tttaaggahg 
ggctcgaatg 

tgccficliggc 
agtttabgct 
acactcccgiif 
f:Mbggggciga 
acgt^rcaCTtt 
ttgctacaag 
Aa6«<9r^tgc>t 
ctQcggpacia 
tagtgagaci- 
ctg&tgaagc 
cacacatf^at 
atttggat-g^: 

aagcagtcta 
&4s4agc«aag 
taacaaacat. 
atgaaagd^n 
niigtttatict 
atgtcattiaa 

aaaaaaaaaa 



gccgxgtttg 

agciL-Hczttgtj 

c;fl5ctgE£?gg 

tctcgccrcfiU 

LLuMgagnga 

tgaaoa?Ttct 

t-caat^ggafl 

i^H9ptbbt!3d 

ngccttggag 

tgtatiiLani:! 

tw-t-cctgcaD 

acoagagtac 

gaagccgggt 

ggggccggcc 

cttccctttg 

acggfioijggc: 

j^cg^fctbaagt 

tCcctcagac 

t.ggg9<-«g«g 

gcftgtt.taaa 

t.caagaaaag 

ggg^ki^bgag 

actgaagcaa 

tigagga^ang 

gtgcctttuc 

acctgagc<rg 

gagAcQtccc 

gaacaacsaa 

aatccatf Kgt 

cctggotttg 

tctcaaacaa 

crcgag 



CMtttcctta 
ga^Ttgggaa^ 
gccrlJi<^agct 
coQgacagag 

gi-gcagcosjET 
gaagaaggct 
tcagaaz&agg 
gatccaggag 
tcccgcagat 
ctggag&gcig 
ag^;gtggaco 
gatttattgn 
cactcgactc 
^rttaccat ct 
ataccccggt 
ggcttccrtg^i 
ahgvagaagg 
ct^gtcaacac 
gaccacattg 
fltggtgtaca 
gaggaaacag 
attLggsoba 
gaaacrcctgg 
agaaLca&^b 

ngttacaggL 
tLatidciagtg 
ggt-agaaaag 

Ub^aaotaag 



120 
190 

30O 

J?.0 
4B0 
S4U 
60 0 

721) 
7&0 
840 
dQO 
?60 
1020 

loao 
n *o 

1200 
1260 
1320 
13BD 

1!SOO 
1S6D 
IC20 

i6ao 

J740 

LEOO 



<210> 21 
<211> e^T 
■c212> nWA 

<213> Uoano B^piens 
<400> 27 

tggbgccgge ^tocatcgcc ttCJ^^oatgg cipcLctgcag etgtgbgttc tgcagc^tgb 
cgagcat<5bt catctgctcc accacgctgt ei^^*i<7«ai-.t. tgcaccgcga gbctgcccgt 
cetocacggg ttcattgcgg egcagtgt^g acctggcar^j acggsc&gcc tgctgctgge 



60 
17.0 
l&O 



wo 



FCT/US9*/DS7iI 



]6 



tccttttctg 
Lcrccggggccr 



RCtttcigtct 
acgcrl.^krx^b 

ct caccaagt 
ccEtcagggtm 



cctggtacgt- 

ctcaQcacgg 
Q^iLgygtst t 
t.cdSGtgactia 
tggagcl.gt.n 

gtgctcacfi&n 

tcagaaL^MM 



CigCCCMggcc 

g*?t.tgcQgtg 
gggggi.i^gt-g 
gctgggaagg 
i:(ia@aaL<:oQ 

t-gr:i;;(ibccitc 
CC L £j gcji^t t C 

g«bccttctg 
ggcrragbgto 
acgattccac 



gtatggttgg 

gC-C Lc:cn;'ggfj 

gtc^iccetca 
tgi:gat^cas<4 

ttcaciicaijcj 

ctQFtcacagt 
Lgrigb^ctgc 
cacceacctc 



g&aga^:L-yijM 

gtsbcatgga 
tgctggagcri. 
bgg^^gttcgD 
gccaggfaggc 
agisnggncca 
gcccccag^ta 
gtfrlicAgcjggc 
aaaataaaaaa 



300 

4HO 

60O 
CGO 
V2U 

847 



<?JQ> 2^ 
<212> DMA 

<213> naiEK> sapiens 



<400> 2a 
ccacgcgtcc 
tgflOTRbctg 
tcaatgcccc 

Dcctgggagg 

gccgcaaagc 
CflLSeb-gbgag 
gaggagctgt 
bggtcggact 
ciagctggagg 

t bcsc:ftcrC5©t-C 

gagctctg^zg 
Gbgccgaaa?! 
gcggttgtLa 
cccnggbcct 
aaaaaaaaaa 



CBctijccggc 
ga^j£rt.t.gMc 
gccoRHtcac 
<rc:l:crl.l:tgga 
octg^rtgaat 
aggctt:r:bt.t 
ozitqaacatc 
ggt:^'jflt-gagc 
cai^cactacg 
^tgi^agc^gcgc 

cgaagattzrra 
ghjgbgafftCt 

ctgcat^aa^ 
i^Ad nctoaaaa 



agagcgcl.^TT: 
aLhggbggga 
■stttcaggaaL 
cLAgtcotgc 
ccactgcttg 
aacat.^!bt.tg 
Dsgatgatca 
CTftgurcatct 
tcagccst(^L 
^^^ggcitccRffi 
Cl^cctgccag 

ggtgr.aif.'gc^n 
gtncgcgacc 
ggaggg hy^o 
ccagtttaca 
aaaaa 



gctngactgat 

CtttLCfwgbt 

t-caccaatga 
ttatgt^gtc 
c«>ggtccct.t 
tiagt.gtcrago 
1-9<?bgggaag 
acccggggga 
btQcsactct 
gggctccgijgg 
ggcbffccgtg 
acafl<|fi<:tr?g 
bacgcctcc't 
at.c:9gg^ctg 
cttcctccac 



I .i^^ttgggceigg 
tgsjctataac 
g^ic^nbggcag 
cctcatcgtg 
ggcrt:Al*.i:tM^g 

3Ctgctr:gLg 
gcsgcoccccc 
gggcjiiLOgtg 
acttcsa-cagg 
cc:£:gg<:gi^cc 
tccattccgcgg 
rfcrgtttbbcicg 
ai^agctg cga 
^wgcM^cbgtg 
aaaaaaaaaa 



atcccgctcc 

gcccCTtactg 
tCtc:Lgf.^^Kc^ 
cb^ggeiao£3a 
ttt@gatt.ca 
Stoaatg 
aaggagcticc 
QbggQooac^Q 
ggagctggag 
atggc^i^Tctc^ 
cagt:gc<:;d1-y 

gf^tgicitt^dog 
^TEPagggacc 



120 
ISO 

300 
36D 

6l>t) 
660 

7ao 

500 
960 



<:^12> DNA 



<000> 29 

atactctgal. 
tbagoeittgc 
gcttcctlil-t. 
abcagagata 
tggtaacrl.bf^ 
atggackcagc 
aaaataLtcc 

tggaaccrrgi. 



aaggcrtaag}^ 
<:tt.bgctCTtg 
aaaacgcctiw 
ttcgcttatc 
tgccLtt:rtt.g 
bgaaaattta 
taagci^tgcc 
r^bcttcattc 
cagatjotgaa 
e>beiagaa3tg 



astgaacactg 
c-tg<|etbtt>t« 
tcagattgga 
frLKfl.t.tgtaa 
cccttaatag 
ar:t:gticiggaQ 
etcttctgaga 

ai^ttgatgteiK 
cktggctgaaa 
tagl:Uti»goc 



tgaA»tbtt:c 
tagtgtttal. 
1-9gciacgctg 
CcatgtLi:t.t. 
Mt.gattcbs$a 
baaaattaag 
^jrit-bCgciCfra 
cccttcca^b 
ciaatgttctc 
tgaaggacLa 



5tbatat.agA 
ggggtggaca 

ggttgd^s<rg<a^ 
Stggagacca 

•agcctcaaaa 
agaLlhg^ictg 
gttc>gt:ggat 
CtcagaA^jas 
l-.rjictccatca 



C<9btbb2iacfe«t 
actaagacat 
gcattttirga 
t.cagi.:£saac^» 
gag^aabtca 
tcsg1:Ll.<?ots. 
tggsTtagaag 

ataat^tgat 



60 
1 20 

lao 

'A40 
20O 
360 

4BQ 
54 G 
600 
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tgagctttga ttL^nt-^ttg cctaasaatt liKtpttcft^jt catcttggaii ijl:<;tgt.gtat 660 

i3a*m3i»t^yw.«g antccagatg gl:fi<jgptg<jtt ctatsgtccrl: luttaiiagcta actcttgagt: 7^0 

fjtcagttffaa tal:ct?r4hto<» att^ygatctg uA^^ataaTOt aaggaaagli^ ii-fcrtbg^topa 7 BO 

g«»;i?tgcM?r?« gntgcctctt aye L:g?s tags tcQcaggccr. ytigggtttDt attctccctc 640 

ttctctctgg aacatatg^s; aattccagat eft£i<?gociacia ngtuttttaa C:a;^n4cit»{99cs 90 0 



91 a 



<2L0> 30 
<212> IJNft 



<220> 

<22l5' SITE 
<222> U\ 

<223> n ec^alB a.t.g. ox c 



<221> SITE 
<222> [7) 

^223> n equal*; ajb.g, or c 



c400=* 30 

i::^CT>tgiiatt ciatctiaiicag ft ftcaatg^tg tgagcatgaa gajfgcacaga caggtcftcca ^0 

aaatagatgt taggatttso gtgcrtaccrt-g acciCiSQaagt aggU;t(?.<sc?r^ c^tccaaDtoc 120 

tggggatgat aKflataatcia ptgacSJtats tatal.Atttg tcattttgca taaaatatLg 3 60 

tgaaaattgfi ai5gagg©<?ac tcagtaa^^oa tcctisggact aLtl.gtasgt tatoQcaaQO 240 
wflgatgaca gasaagggatt *igtcccctct gtatntutcsgt. tgtctcttag taacstcaaa 

ttgcagttati iar^^^iatttta aacLatgb^ir: aagctacaaa atagMtcbr: tbtMbaot^* 360 

t-stbtgagtB tgtaattLl-.t^ gttCDttttc tggutgbatt tgcggtagtc agatgLgt.Lg 420 

gattgatlicri;; <«ActggaGag agr.a^jggaat tccagcaGc^:: l.ctt.ootgct. tgctcatKtti 4fi0 
accocncaga tcasacK^ibc Mttctagtt g^ggci.tgotQ tctagcccca cacrKeLgaKl. 
gaagcctuaa gcacbgttgc gcr:l-.c-u:«tgt gctttggat.^: agcaaccicca gt^iEStattcti 
accogagcat tgtgggae^ig ceigatgtaCa gLo^ggtccc aayagcaaBC tgLl.gggt.gh 

gagagttcLA *»agt«tasjgg gtt^agggaog agaaggatal. g^<act.cotcb S£iccttiaa!5C 720 
cngcafctcat ttaaccttta tgtctactta aea^gag^uic Dtggagaaaa ct.acrcgl.^tl 
caagagatta ^boaooatica guglittt^igc caggcgatga i>cigag«agcci ccactcctca 
^icptc^attc ttgtaatghc tgtaataaat tLcAgbgcgt caggatggat gaactt^sii^a 
tccagtgaat. gabtcogctg ttce^aguct tacattttcrc att^attcatc atccattccc 

ebtcagtgta acctctugM ctattgtggt r.^AttbtQtc taaaaccagt C.l.sabgttttt L02D 

ttcttttmssa twbgtgccta tgli^^abaasg tccacacacrii ggctatctct ?Ttagaggtga lO^^O 

ggbtttgttt ttagttgUtc tcictgattat a l:cc?t b ttcrt ga^ctatgaa aiat.g«n?t.tat 114 

taataaaaAn bttttgaaca a^a^Qoaasct aaaaaaacl.r;. gag ll^.i 



540 

660 



7B0 

900 

9&D 



<210> 31 
<211> 2377 
^2123- DNA 

<2i:^> Hodno aapi^ne 
<^0\» 31 

aggggcacga g^rotogstgt tgtcgtrcot gctagtactc egggrtgtoCT gggtcggtgc 60 
gg^Udttctag tcatgaaaLr agggcaggga ctMcLaczc-.Bc. agcgacgcgg c:t-gg^?«Agi*c 120 
tgtttsjtgtt gcrgggggccg gacttriei«gg tgattttaca acgagjstgcc. Kctctccata 



1 HO 



wo 9»/4«i8» 



18 
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ttcl:l:ai3k^4>rj: 

tccc:l:l;i>C!l.t 

tgtbcscttct 
iracagaacat 
1 1 1:^1 tOgg^Rft 
caataaagat 

aaatgattgQ 

tactgaaaca 

coaaaacata 

Kcccagcttc 

agcacaaaga 

p-ggp.5c*j!2tc 

tsbcsnttaag 
aacttcagca 
abtagccctg 
gacttatgtt 

ttQtarggcc 
CC!&3bn«ats 



to.ctg'cctg't. 
at^a<rctc£rtt 
bbactct-CcAaa 
catictggg<irti 
bbziaek^Qcac 
ct:cratLa9<:M 
tbt-nbt-ETDtg 
caccttat<^;<-- 
nbcatemtag 
gatagacfatia 

Ratggagaca 

caaagcgatt 
acccrel:l_te*c 
tccoEctcac 
act.gai:::agbn 
<^ctaat£agtg 
tttatttccic 
gatctctgta 
ggacttt.bgR 
tgctgtaact 

ttttgcgttt 
cac'rtl^riMba 
tgccaatcat 
&acrb<:«iB.ago 
taCactagtg 
ag^btbgatt 
PfTt^ctgtag 
a«bgbbaata 
ctaaactarg 

taact.atgta 

ll.gbabgAQfS 
tctgttctat: 



agaagrt^sgan 

accagngcaat 
ctttcatcgt 

agag^gp^j^grg 
tgcfoatccaa 

a a t.<l& l-r;:^go 

tagbt.i;caga 
grate U^tcb 
gagctagaag 
agaaaggag^ 
ai5i:sbcogaat 
ctaaaacrcraa 
cagt^ubctQct 
boQCTacirttt 
atcragaacat 
gbgatj^caga 
aac^gg^bta 
tttaaaaLga 
ga^^jtaT^ctt 
tctggtattt. 
cra^gcsatcaa 
aaatcctcia^ 
bflcatcag 
Ckas^tCaccLa 
Lagggagagg 
tgtttctitccr 
<':<agcsbbtcaa 
faagcagcrlii^a 
l:g tb« bcaaa 
ctcagatiut^K 

Bfcatg3t.gl:ai. 

y^aagbgtag 
ggagtttgaii 
tgbastettta 



t:a<rai;;catct 
atnttagcaL 
ggi:i-wtagRc 
gccbs^^tgaa^: 
gtuaattasa 
atacaaggcrt 
<2ghi<-attgQ0 
baattctctg 

catctngata 
aaar:.^g?%ctt 
tg^gtatgat 
cctt^iirtibgb 

cacagac*&gt 
agottotctc 
sacagaat.cb 
aa^tgoaatb 
caacatat^cci 
tba^jstatat 
atttttL^gb 
^tcbgccaag 
ttcagt^l'.ctb 
b-atogcttgc 
gtcagiigctn 
g^gb&Qttga 
caacaaggt.^ 
t^btbctgttc 
cacttttbbb 
bgbbbctgaa 
gcaaaaflg^B. 
sbacccctta 
actQthwcig 
bgaascactg 
actaaL<jt«b 
bgbtaataaa 
cttaa^a 



tgactgiic^":^ 

ft^-ttnctttga 

ttccagctg^ 

Ccdbcgbgcc 

agacttgai.1-. 

sccjitagttc 

^tactdaagd^ 

t^nogatgaat. 

cctctggaaK 

^tcatbbtca 

cj^taaagat-i*: 

gbabgbgcca 

tccratgctt 

cc&gtgtgca 

ogtcaagaag 

agt-gccviagt 

tcagactatg 

fia1:g£9«r7abg 

KQtactgttt 

actgtaabbt 

attctacagt 

cttttggaat 
aabb-P?4gacc 
cagccac-cua 
gtLgi^{&{9bnb 

craa^goAgtc 
baacatttta 
ctLKrti:=t.b«Ci 
aatgctgcat 
atftflrtt.?4gc?L 
tattggattt 
taLb^t-gAcici 
eiatcaaaatg 
cal-gtbbbco 



gcbntbtgca 

aaatgcatct 

taggbbbaaag 

tcraccagta 

gbciabb btgo 

acaatgt:tg£: 

a^ctbtgckcat- 

tcacatatga 

cT7baccbciat 

tgatcacaaa 

Auvbbangeia 

tttatttgga 

a«-K«ytgca«i 

etgca^aaagt 

aaaa I .gti ^g b 

catttgggge 

aggavag<3 r;ga 

atiCTtcgtggt 

gatffctbcaga 

sattttttgc 

tLaabOiwiRb 

attcactaat 

gaAagtflbag 

tagatcacag 

gbtctCEcat 
tc'ctcrctalzr 
btatcctgac 
att.actLal:ag 
b-aatbttcag 
aaaaatacca 
bcttatgcta 
gaaagcaagt. 
agctaeiattet 
cLaaggat-LL 
Etctttaa^at. 



240 
30D 
360 
d2D 
48D 

600 

720 
7fl0 
BiO 
900 
900 
1020 
lOPO 

1200 

L320 
1380 
14^0 
1500 
1560 

leao 

1680 
1710 
l^UO 
1860 

15B0 

2100 

2220 

23dD 
2377 



<213> 7 95 
^212> DNA 

<2l3> i^omo aaplane: 
<221> 

<222> (751) 

<223> XI egualB a,t/g, or c 



<d00> S2 
ggcacagtQsc 

gCt-Lgligggc 

gctgctggca 
ccgaat^cbrsb 
octgtatact 



agcrabcbacc 
ttcctcgtca 
ggcat.cbgco 
csttgctgtgc 



taaticcaggi. 

iKirtOCbOCCt 

tcct.gcCc:a« 
i-99gE3«igcg 
t.cccacaaT.9 



5«bcttti^gt 
ggg1.t!g^;cgg 
tgggj^tgata 
ttglictjggcb 
atccggcaga 



gci.gbggaco 
cctj^cccaga 

gcctccttca 
cagg^jAtggg 



tgcctgccaa 
bggvtgDflcb 
agbccattgc 
?*cb5CQbcibb 
aabgggcagc 



60 
12D 

24D 
30D 
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• 



19 



accatggccf: gflLgtgu!jc«^T catcgtgagu ccactggtLga gciM-tgc^ctgrc c^agctctac 2611 

ccctcccibac c C etc t teat tttcicgatgct gttcctijtgg ccRcca^cgc t41--^;rtot.gbc ^a20 

ctcctgccag A^pccotggfj ccagccaci-.y c^co^aoacgg tg i:!ag*j«cot. ggagaacsgg d^U 

oflLsgtjgctQ^ic agacgcgaca g^:*iRc<!ii3Lga!; MMagaast citatcrgtccc t tjcauScc 54D 

ccagcaceag ag-aAgvatttsg actctgagg* <^tg^l!3aap^<^g gcrcrk wga accctaaa^g 600 

gfl3ggo??ggt cctacaggtc tccggcciacc cacacfia^gfl ggasciaajjag g£siiaLggt.g*a C&O 

cccaagtgtg ggggLl.gtgg ttcaggaaa^ ci^tcttccca gg^gl.<it:t±v^^r- tccctttata 720 

aaouwacca Qotaccacatc atlitiaoficot ttgactgcygop aaekaE^aaaaa aaaanajfltitJ?* 780 

aactcgaggg isgggft '^^^ 

<:211> 26S6 

<2t3> Homo Bapie»£: 

021> SITE 
<222N i2C52i 

<22l> n eqwiiils o.t^g, oi c 
<dOD> -is 

gatgagtgcc tagaegctgo naCgattgaa g<j»gaaeitt£i agtcLlitoc!© ttcagagaat €Cr 

tcagganOTt c«tgscGaaga geatbggttt actgaatt^c ca^?ctgcgtt aacai-.th-gp? 120 

tbgtcaagat ttgajibttwo tcaggcattg ggKO^jaccay aaaaafibti::^ caacaaatta 160 

gaatH:i^wci aogttttata ttLggnctaga tacatgcaiia g^aacagaKS aataacsaaga 340 

'attnogaggg aagageL^5(^fl gagactgaaa gattciQctcffi cggtattatrri ^^c^aaggcta 3 00 

gaaagatah-t taagctacs^g ttcc$gt^?co aaacgatccc wttggtaga tgttcttcag 3 60 

totgcattgg aatctgccrt^? ciagtaaaccc gtl.t^^;actt ctcctgttga ^jgst.attgac 420 

Sjctagttccc cr^cctagbgj? ttccatacca toacagacat l:ao<:TaRgca.c aacagaacaa A^^O 

Cftgggogccc tatcttcag^ actgccaagc acatcaccbt catcagttgc l.go-r.irtbttca 54 0 

tcgagateag taatacacaa aceatttact cagtcccgga bi!if7ctccaga tttgcccatg SO^ 

catccggcac caaggcii^flt Mcgaaggaa giaHK;t.ttctg tgctggaaag t-bgttbacat 660 

cgctggagtjei cagc^aataga aaeUg^cacc agagatttgcr «aggs«acfcat atccagaatc V'^y 
catcgaacaa ttgaatitctot- gtactctgac *iii<at.cbataa taca^ghhcc- ttatcgatta 
catgcrcgbtt tagttcacga aggccaagct aatgct^fig^ atytactgggc atatt^btttt 
gatcatcgtg aaagcagatg gatgaagtac! aatgatattg r:l-.gt.g«oaaa atcatcatgg 
gaagagi^bag tgagggaetc tbttggtagt tataQa^atg ccagtgcata irjligtttciatg 
bacataaacg ai-.a^^ggcaca gttcctaats oaagaggagt litcici.t«Mca aaccgggcag 
eccscbtgtbg gtatagaaeo attaccaccg gatttgcigag attttgttga ^g<9Agacaac 

caacgattcg fiisnaogaacc agaagaatgg geitgcacaac lit-goccagaa agctttgcag ia4n 

gaa&Agcttt tagcgtctCA gaaatcgsga gagtc^^gaca cttctgtgar: j£i dt^agcac^aa 12 DO 

gcagcaggag AcccagMta tctaga^cag ccatcasgaa gtgatttotc aaagcactcg TAfc^O 

aaacf^agaaa ctattcaaab nabtaccaag gcraLc?^catg agcatgaagti tasaagtcct 1320 

gaaacagttt hgcagtcgcc aactaagttg gaatatgcaa g^tbgottaa gttggcccaa 13^0 

geagacttcicc caccagaaaf: i:igottatcgt ttacatt^atg tagtggtcC.a cfcttatcc^ag td40 

aaocaggcac creiaflgaaaat tattgagna^ acattactag ^^ocsoatttgnf agatagaaaL 1500 

ttgaghtttg atgaaaggl:g tcacaacata atgAiafligtcc ctceagccan rtctggaaatg 1560 

atoaaacctg aftgaagtaaa cttgg^ggaa tatsjaggagl. ggcatcagga ttateggaaa 1620 

ttcagggana caactaUgta bctcataatt yggt^tagaaa actttcrAw^g agaaagttat IfiftO 

atagattcct tgctgttcct cat^rl.gbgct tatcagaal.^j acsaaagaact cttgtc.C:-a?^« 1740 

ggcbfcataca gagga<ratg« tgaagaattg at-«twflicatt atagaagwga atgtttgcta IfiOO 

aaatcaaatg agcaagccgc agaacrtt=btc gaatctggag ^ggatcga^a agliat*Mon?*t. 1B60 

ggttbgabta tcatgaeitgc gbttattgCf: or^^atttttgc cattatl:rti::t cgtggatgaa 19:^0 

atggaagaaa aggatatacc agctgtagaa gatatgagaa Atogatggtg ttcctacctt lf?SO 



7ao 

B40 
C|0O 

lOBO 



wo 99/4fi289 




PCT/US9g/05721 



catgsactcL 
g^nagatoiaa 

CLtBGCttCC 

tgeuLisabccc 
tgatttLAi.i. 
ttotcttctc 
ttaagt.gl-.t.C! 
Eitflaa£5t ttt 



tggziQcosLc^ 

cttjcacactt 

atggcgctctt 
rtt.ggcaoeiac 
cttctcaacra 

atttatgg^rt. 



ii«gtttccat 
tg-:?ccg£iat:<;: 
t<^<:r?tgeiaM 
ttt-srcttgiiL 
atgat!:b<at.«c 
?9bc^ittteaa 
tatgcLhttg 
^at bt ccart 
acgae:iic«t.a 
oaaaaaaaasE 



etacfCcoccga 

cactgtptaii 
gctgctatiag 
□tgacttta^l;; 

CCgg^QCtga 

ccacctccfjl- 
cagtntgEiat 
atatttggi.rt 
^r^fl^actaaBta 



•rattttt tgcc 
agttaccCLfj 
l-t:agtcgaac 
actcctttta 

cai?actacag 

tgtaL4iMKc>p 

gbt-Qcaptat 
ccttccrftl;g^ 
gc btgtaaat 
aaa ai^^fiia ^ti^ 



aaaactgctt 

tccCgctgat 
gl-hottapcc: 
ttttttattt 
a caa t r^^aLjc: 
c=ci«c» tctaat 
tactttgciiL 
cbacasjccat 
gaaa t:aft.^ga 
Maaaaaact 



2100 
2220 

22ao 

7.i*0 
2400 

2520 

2640 



<21i> 2!^6<v 

<213> xidCM^ d^apienB 

^220> 

<221> SIT© 
<222> f2£i.^) 

<223> n eouals a^t.g, or 



<400> 34 
gcaaatagc^fi 
gttaatgbtt 
bcgcHcagag 

atcccacctc 
bo^taaaeigc 
CggtcctliLt? 
ccttgtcgga 
cgetcaceci:*! 
gc<itggtatt 
ttacacgcca 

gtcratcacca 
atbt^stgct 
ctgtat^iirnt 
ttaagtattt 
t ttatcag ijt; 
t.tyatcctac 
accaaaaggt 
acatgcttac 
gaatgtLggci 
cagtbagtga 
tggagcft$<4a 
Qoaactpj^ac 
cl:ggai;c!tga 
ggaaaaagga 
gLgicjOeogag 
catatggaaa 
rtALgg^fltgc 
gatatattiga 



A^tt-cagtac 
tgttcttal.t 

€jggaactci:ti 

^^agtocccD 

ccagcgal.gt 

^noDacMj^ct 

ccrajficci:_t.i*cj 

ggatctccsc 

t.cttacaa&c 

tnoantttac 

cttttliaagt 

tcotccaatt 

alitircRcatt 

tgcctttttt: 

ttg^aflMba 

ctttLctctgt 

tgeAg«>pngt 

yctycggttt- 

n>ci:i.cgtgofc 

cgatttgcacr 

tgltggggtC 

ta caeca tgg 

QQtcactga^ 
9A«igccaaag 
saaagaaang 

t.agcaat.g(i^ 
^ciAabgiX£Taa 



a tea C. albeit 
bgaattattt 

C CCCg ft ^^iag-g 

t<^acicaaggg 
gcaaa^ittgaa 
cccctccacc 
ettii.vibggta 
tcatccaetg 
ccagpioacaa 
tttctaecte 
tcatttbatt 
ctsjcaattca 

ataacctbcc 
tgcatatttc 
taatttLgtt. 
£itt.g«.attct 
tcattgaabQ 
g^tcctactc 
caaggty^^g 
zsj^gtcioa^ag 
egcctcrtgtg 
ggbcccaaaa 
aaacaa uvitj b 
gctbgc^agntg 
aagt.i:<:!acta 
gcattggaga 
tgagi-j^bagc 
«teictatj^aagl: 
aagiganggtj;! 



aaatagaaaa 

tair:tciRcttg 
accctattei^ 

acaagtgati;:: 

cagt<:icaagsi 

ttatccaLhr; 

gbactgcccc 

tgacatcirgg 

C<m ctggcta 

tttttaettb 

utbttttttt 

tccatcrtcict 

acccctggc£s 

atatftnonat 

gtg^aaatag 

cacalitgbcck 

otgttggcat: 

agcLfcocaaa 

gsccaagattt 

Bt£jt«attgt 

Qbf^aaaagg^ 

Ct^gggagCt 
CQCtjIltttLi: 

gi^gatgaaga 
S^acinaccat^;:^ 
nggtaCiCtCt 
cagaaaaar^/1 
tttttabaga 
cagaaacgta 



tttttgtttL 
ggtbtcecct 
gtgcragtar^L 
agogtaccag 
tttcacLLU: 
«gstctttct 
ctcttaggat 
ggtctttctc 
atagc5-i-b^t. 
tattaagtta 

ggLa««ittba 

caaaaagatcr 

accttctaatc 

ggaacatata 

teagbtgattt 

ataccactca 

tgg^atttat 

gggctgtaaa 

gccgtccbgt 

tgtgecLLhfc 

bat«gaatgt 

caatgcrlJ.Ur; 

ngas^gaagaa 

agaagtabcu 

netascaggta 

geatiLatcstg 

toaasagagt 

ctgagafctQa 

tngtggtata 



tgcttagatt 
t-is.^cictgttc 
tcttaaagtc 
ftt^sgctbttc 
tcagggtacc 
ti;;AOctbttc 
tcacc'ccagg 
atckocaccam 
cagtccccrtg 
bcaagtaaaa 
ggttgt-gLfciit 
gagttgtaca 

CCt.CtfitgCOCS 

tactttgtgt 
ftbflt-tbggtc 
tatctatgtc 
gft£)M^{9gt.gg 
ccctgctgta 
x.'i:gagggacc 
cccactgcta 
bi:!«ciaagat© 
gatgtttta^: 
bt-gtcattija 
gaatataoc.'A 
aa^cbaacoaa 

ktgktaaagt 

ea1ig/9«gctC4l 

agate tctaa 
gti^t.gctocQ 



6<) 
120 
1^0 
240 
300 
3G0 

4B0 
f>4n 
600 
^60 
720 
780 

900 

102O 
IDHO 
1140 

1260 

n32a 

12BQ 
1440 
IBOO 
15&D 
le;2o 
168D 

L740 
1800 



BNSDOCID: <WO 9946289A1TL> 



wo 99/46199 




FCT/US»9/057i1 



nttbgtgtaa agtagatggg ggaaaticttat 
ttctggaagg clAcflitMgw actcgataa^ 
9t:eb<aaggga cacaagtgag agl-.ot.tttca 
aaaccaatat t.at.t:L*it.t.f:i3 aaaaattaais 
Lf.tt^itsiib^ tt.taaaaata att^nrttcittt 
QrDcaciat.5gg aataagagntk baptaataac 
Lcasianntgt gb<jzRt.€igctt gagi-.r^tggijt 
gbacQCtcaC ct t tg tg L l.jy ogte^gcacaQ 
ttgtatcrAgc tggtcttgag cctLL-.gacct 
actgoaatta taggcgtijwg ccacaatgcc 
9LAtt:agt.t.<aL sccstgtttac t.cac:gt.RQaa 
cata^Qtcat ct.atascrAc:ii tataataaag 
acLt^gagggg gsgt.cc«?at.a cccaatt-ctc 



aaat^writbc cttgcatati^ cftt-iflaaatgt 18G0 

attgottgcc t.cf1-<r:<?ggQac gnaactgaac l$2l] 

ttatacgbgo ceittatacct tt-Le*flflt.ttt. 19^0 

aabeigtcttt taaisl-t.^oaa ataaa testa 2040 

ctccatgtsob tbsjaaagaag gggl-A^sPMa 2100 

rraocnttggc taaa/^^i^neiA actKtgaatt 216£l 

taagat.cri:^»c ngaattacat t:ggiiv<9Datb 2220 

^t^aba.teuis?t gggtLat^ttc te«iaaaaaaa 2i!a0 

cctgatctgc: ccgcctca^rc ctcrL^U^^agt 2340 

Lg£jt?cacati: tatgLjabttt tttatattct 2400 

gtttcccaov Atgtcttatt at^riutitggt.c 24C0 
La^;flbc©ttj? ctgaA/aflRM naetaaaaaaa . 2 550 

ctna<ratgt>» teg tat 25 6 & 



<211> l^a^ 
<Zl2t. DNA 

<213> ncomD fctijii ens 



aattbc^goflc: 

ttacaaycta 

fitttbtt©«i5i 

Gccaagsfttt 

bbci«grgggci 

gtggaaggag 

tbccagctct 

ataaagcff^LU 

agtcsatgca 

gttttttttb 

tcbagttgtt 

aatcUgg^t-t 

tgetaatcccc 

tccartbgt.t 

attgctgttg 

taattiitgfct 

adtttttcct 

tctateL^tr? 

gcctgoagta 

agctaaatgt 

otttbDtata 

ettgctgccg 

oggteictctc 

tgkcatgg^a 

gobngcaaga 

gttc:aa«iei»c 

tteigagataa 



et-cccataaaa 

tacjitttww 

tgatccctac 

Lgtitcoatbt 

tCTt teat tag 

Sggctbjgtta 

jjnggcctugg 

rabtagttag 

aggcfegctgg 

ttcccttcac 

gttatfCbAaa 

gtcatsgatt 

t ttyt-tiaaRQ 

bacaactaaga 

acat^ttaat 

btaggtcttt 

aaccLac^agb 

bgaatatgaa 

taggagggac 

ttcttacaaa 

aagfjL.att.tbc 

ggagactaca 

attl;cbc;^nga 

tQatptigact 

acL^gcagta 

ttcccagagi: 

Lbatttg^iaa 

actgcatCigb 



Utgtacaaaa 

gttttttaac 

catbttttaa 

ctLcaattctss 

ggkgccngct 

tgagcagb^gc 

tcaggttacg 

gggcaCsac«iD 

tacctttcag 

gltl-.Obeiagac 

aagsc<?t.tgt 

eLttabtct.t 

tgggcaaagc 

l-«bgtbacag 

actat.tgct9 

tgtttgaaac 

tttlictcrtg£i 

clibDtaggiJt 

aacgtcaact 

t Kbfl?igociaG 

tcktattttt 

cl-goaacatis 

ataagggggt:: 

ggtoaatetat 

ccaagaaaci; 

d£ittaat.atg 

tttt.L4tt?agc 

tgttaaaaaa 



ttgta^^fligta 
Qscttcaaag 
atgatcsccita 
gtctatgaaa 
gggcrat.batg 
atgagaaaaa 
td-cttcasTt 
cgcagtactt 
gbcigbacttg 
cagctgag^^s 
Ubtaaatttc 

ti^tssattcta 
agggaaaggg 
ttgacatgtt 
agtttacftgb 
gt-Aoa batgg 
catgttLAtb 
^i7tgcacgtg 
tatttta^ttg 
bgtctctaaa 
gataaagtg© 
«ittcctaaat. 
tcatci:t.btc 
ntca^ipsaaag 
taaagccotg 
ttacctcatw 
aai^Roeiaaaa 



cattt.ta&<::rt 
L^^agtbtcag 
ccaaaatgga 

t.tgargbcta 
agtgtgttLi. 
rjsggottggct. 
atcgtaactg 
-cttaagtcat 
aat.Lci&ggt.tt 
gcaaagttgt 
tgt.gat^cct 
btDtQaaagtt 
caU:atttcit 
tcgttgtttt 
Lain.g«tggg 
ttttatcagt 

ctagtagatg 
c&at^oc^aag 
ccgtttgcat 
ttiagggaact 
atgaaatagt 

ttgaatttat 
gttsatctgct 
taactaac^^ 
otgtgcccag 
aac?U:gt.« 



tabtXgottg 

acaccaacac 

^ggotggtat 

atgaccacrtc: 

tt^^abssgkca 

ttgcttcttt 

tciQbrjtggag 

cttgatgaca 

gtRttRttca 

accactgagir 

aagaatttca 

acraLLiiit.i&g-M 

occcttccct 

ag l-A^fl^it.Rcb 

ctgtgttcca 

ti-.tggbt^Ot'b 

t gate tact, t 

atgat:!t-.bci;&^ 

ctti^aaggga 

ttLtf:Kfl^g*> 

gtccaggttt 

agaat^c^cttRc 

ggkcaactat 

ttflbagooc^t 

actaaacaga 

aggo fcttgct 

gtcagcrl:gb« 



60 
1/20 
ISO 

3D0 

420 

B40 

«>on 

€60 
730 
7S0 
B40 
900 
9&0 
102O 

loeo 

1160 
1200 
1260 
1320 
i:^»LI 
1440 
1500 
15&0 
L620 
16tifj 



<210?' 36 
■<21)> 5B3 
<212> Dwa 

<:213> Homo aapiens 



22 



FCTflJS99/0S72I 



<4Q0> 36 
ccgcccuccc 

caggaatLcra 
acetate bat 
gggstactgg 
ggffckccccttc 
tctatcttca 

aeiccaa.tccg 
a£iaa9ft«iCiQAa 



gc<rgQGcrcfJ9 
rsttcctcacc 
gaagccagttc 
grwctattc 
Ccsacratl:a» 

acctgg^atga 

tagcgt-tga^ 
g r-^oc-Dtggn 
ciratgattga 
fiotstgtgog 
cigtctgtgct 

ct-ccaaaatt 



gtccggaatt 
ffgg<^tgoctn 

cic^tcjagcaaac 
ggagfli-^;agc 
bgcttatcca 
gtta I. C;ggQg 
f totgggaga 
agagaLlii;^tt. 
ttncctccte 
ccacackoca 
c'Ktctcatgg 

ttCtctLtgrt 



Ktic^gggtcga 
gcctctwbt 
^tcDcggags 

acigoctgaat 
caatgtca^o 

ggcaaggoa^C 

ccattttgtrt 
gaiit<jccict:a 
gamtatcac^Ta 
tgfttcicstgct 
gcctgcggci; 
ataiagagcat 
actttgggcL 
tt^fiiSagtct 
agttcccaai: 



bccetgtccc 
ggf^Lgi^bcDg 
tcQag^^cijaa 
ggi:£!c:ti*fcac 
tgatctBtga 
tgc:iLLt?c*3b?i 
nggtcccaca 
at.crr.9£)ncjgc: 
5Lcaccacctt 
ttgage^g^:t 
ctggKtatca 
tttc^Ciag i^go 
gtgctiaagtc 
acgLcacggc 
azicteit-fsaaaa 



Mtctacagc 
ac^tic:« tc^gp 
aaQ£3caccet 
atcLgt-trttt 
t^accottat 

agcccta^cgt 
ctCUggCyga 
ggeitgtgtat 
gccaLgaagb 

ttgtcttggg 



£0 
120 

240 

3i^0 
d2D 
^80 

600 
660 
720 

840 

S60 



<210> 37 
<211> :?35L 
•c212> DNft 

<21:^> Hcono Bap lens 



ccacgcgccc 
gayotffttcs 
CDagagaaca 
Lgtit.tfcgtgc 
acctttagta 

ttcs^ctgtga 

caaccbgncc 

tagatagttg 

aatgcitatgo 

accggtatcc 

ctttatgttg 

gtgtattaca 

Cgahtggbat 

taataaacaa 

ta&aagaa{r£t 

tttttBFtttt:: 

tfcgctttgct 

tasfaagctgt 

ccdttgctca 

catgctttgei 

aocsaagtasg 

aggatcra^go 

ccattgcttc 

t.gacrtccFt:<^a 

t^ggagagaa 

aac tgtgcaa 



ggcAgaagca 
taaagtcgcc 
ggijcbocroa 
gaaaaoatgc 
gcggtoatct 
aaacnggagg 
cbd ttcctac 
tac a3aaac:<r: 

agcttatgta 
a^^bctflpgta 
ctg tccacfcr^ 
gt.«^7acaCTtt 
at.at:gacgnc* 
<::>atgbcacag 
actgagcooa 
bttttaattt 
ggteaaastt. 
ggccaccgga 
t<2rat.t.AQQcei 
tbcctttctl. 
S<r:<?t.gtt5jja 
tcaaaatga^ 
ttcfcatgoca 
gactacacrch. 
aaaagaooDt. 
i>t.cfcccacatt 



gcagcatfOrtg 

Cgacagcttt 

t.t:acaatc;tb 

cbtcjcacttg 

acctiggtgtc 

ttggaataca 

atcar^bgtgg 

Robaataagg 

aaa c a b^ccg 

fl.tttacaact 

ttaatTCbtc 

bgggctatgg 

gtc^geicctt 

ggssaacttta 

ttybttgcca 

atccaacctg 

ggogtbtacc 

ogcgttatLt. 

cbcccggatt 

ttctcatwtQ 

gccatgtggg 

cggat::flgtccfc 

i7»bAcaagtt. 

ccttcaccoa 

4iccaDa£3cta 

tcagatfifi,F3t. 

atal.iji^ittac 

bgtcaacaca 



Qeisa{;acagt: 
ttcLtM-gbag 
tbci^agatifrt. 
taf?c:bgcogs? 
aatagtggl-t 
C!ioi7a«iggct 
atgatiattgi; 

at Lot bigger 
gaattgcaa^ 
ttabgggatt 
aagccaaagn 
g^gangbatt 
tcaccaccAb 
g t^tt ttcaaa 
tgtatngbcjt 
bbggcaaatt 

i3:tgaaaagca 
ttal:l;43tcfcc 
nekcca batgt 
augtcbcoga 
acggacat^io 

tcggtcacta 
agjiab^totga 
ccgtggatte 
ctggtdbucc 
oaaatagctg 



gucggt-ccag 
ta^gcgjagbL 
tttoccbtpc 
aac t ggag r^iLT 
gcggttcccc 
tggttt.^ttg 
cwiccicttagt. 
atggCa^r:^l.t* 
ofcat^jtsgat: 

toaacagaaa 
gctgLbtagg 
cgctgcttta 
tgcrttggRcb 
cctgctcttt 
litrtttgbgtei 
atataggcaa 
<:'»cgtggirzita 
CattgagAtg 
cc&eiiiaagca 
tal:l.4?g«gat 
oassgaaaaaa 
aCcALci^caa 
ccaagggtitt 
tgaaatocbt 
c!bg©QoaRta 
Qtggattt.bg 



gaggcggctc 

g^onnetac^zig 

tsa^rcggatc 

agtggatbcR 

tatgcrgLgUc^ 

cstggaatctt 

gcatbaHacra 

bttci<?act.tt 

ac'Ctgcagag 

b4itctaact.a 

catLt-c>rii^-bc 

tgDsaact.aa 

atcrl.gbgcigc 

tacttggtta 

t.abtiQ>ngt.nc 

aagctggtgg 

&o«gt«gfctR 

tggcaaaetg 

ttcctcgotR 

ga£sgagggc>t. 

gatatttctg 

Ll.gl-.tt.t-.cog 

gatgcaattt 

ggfio/Acactcj 

agt$9at:act.a 

tgg 1^/^/9 19 gc/9 

tgcttgggac 

a^i^in^b^^^t-g 



60 
120 
1 BO 
2dD 
300 
3S0 
^2D 
dSD 
540 
600 
660 
720 
780 

900 
5*60 
1020 
1080 

1200 

L320 
1380 
lldO 

:i 500 

1560 
lfe'^0 
1680 
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^ wo 99/46289 PCT/USSV/OSTll 

2:1 



cacttatata i.gLnjiaytttb ctataLcaaa A^teattg^gt ctfiartttboc tacacttaj^a 17dfJ 

ctbgatbtct t^aacattagg ©batcgcata ctcaaAl.ygt aoKcaatgac orcfja^i^t^a?* 1600 

tcttcctgat gLL#iu«»ct5c ttcancaag^ c^gatg^actit ttliLi-ttbtt cjacetcagacs 1660 

ij?&gtctbgct ct^jt esc cca g9^:t59««Jtg cagtggcgt^-e otctcsjggtc a-ctgcrata^cl. 1920 

ctgcctccca ag L; Lc?^ bgw attctcctgc: ctcagccctc ccaAgt^^got g^DKc^tacaK 15S0 

gi::aocJtgc?CQ ccatgcccag c1-<aattbttt tt.tt.tLr«?agfc o^jagacaggg tcrlc^rrc^Al-g :?040 

ttagccaggfi ty3t^;ttgRt ctqa cetera l. gubccgccga ccl.tsggrscto cc^inaijtgct 2100 

g3«iat.to<:^ag acgta^gcca d.^cgcctgc; ccaaga<ahgg ooatttttHa aaaaAai^fa^Li: ?160 

aE2ta.Dt.tcct. accactgctg catciagtata Al:gctcoc£3a sttatceiij^a^a i^goataatgc 2220 

ag&aiii:«9<?gn att^atgcjaa cttant-cotg tgccatat&a gcbtiscctsia caaacagtta ';7:?|^n 

tatccctatt cctcaar:Lgn ntstcttt^ca at/s^Qteiaga atttatcaf.b t-noaaaaaaa 2340 
aaaaaaaaan h 



<2.1Q> 3B 
<5i:r> UNA 



<:400> 3ft 

c^cac^gcgtcc gcccacgcgb Dcg^aaatac tart ti oat t^ta atgaaaagbb gb^at.g'ctt^r £0 

aagtgctgal ti^gbatbccig actaaesgfcat ataaatgaaU HacacsCtttc tctctgcftga 120 
gacbgc^igca tgaaatci:^:^ tgctacactg acLggtgaca gtggtt?ti-.t.fl ttbcstagaka ISO 
ctttcttl.tc2 tgtbgttgaj^ atct^tgctg ttcigtg^tgg ttctgctttg gcagttccca ^240 
angtccccta caggac&aiy« atgatgagtg gggat-abasa tctcaatl.tt^.- SLgcagctcct 300 
cactcacagg bstctccjatg gaagticitbijc gtcttgttg^ gcctgtaact tct etc tats 
Unctgctgas? agatgctg^^rj: bgtgagtg-cc ctgottgatsi tccaggcgcl. wgggrztaagg 420 
acctctttgL ggRabagcca tctttgtbbg cinstctgtgc aiil:.t5tgfcat ^rcctcfcagtg dBO 
OflStgcctgc Caaj^gcttLe *«i!incbgtgg5f caagaribgtw acaati^cctg CcBct.figtg* 540 
agagacacag l-i>ggbgaggt gagtatggeit. tatgocaagg aaisgtt-tbct gsctcacja^a 600 
c:tt.tflbc:cts ctecaggaal taactccatt gatcaan^ac acccttaatt gggaCiggg^c: {160 
tcgpgggcaa aUttcataka tgattggcag gactctaaac CgLAtagctt btctagaggi^ 720 
cjattttggoa jaCggggatLa aagtgccaaa tgtgcAtacit otstgactgg acral.lrtJK.Mn; 760 
ttcacagaaC: iTbatccitafflg gaaag<:?at.bg tocEagtaCa cac{»«iaagg^a t^ttcctccg MO 
CSiCDstaatg tttagt^ttg ccatcacctg gggob-btcttt aaaaaaggaa ?^agtbacaba 900 
aattccagta ii(wn?c«itaca gtgcfa^tntb ataaagctgc t-gangeiageit gaagtcaact 

ti7t:At?bact attatgg^nt gettggtaaag aA^ittatgka caaaagtt::ac n^fltcascob 1020 

gaatagtgbg at<?ctatttt caatfttafcat rjtgtgtatl.g «gtBcatatt atgtaaaggt iU^JU 

btabatgcat taattttggg a^gaagaata tMaabgcta atagcgatca tcaazabgcta 1140 

atagtgafcte tgtaaagacc ctcatttbct acttcctaob tcbctcjtatt gtttgaactg 1200 

bttataaagg taaa<n?ceitK ^rtaatttg^g ct.ggEt?rcag tegcrl:v^?t.gc Dtgtnatccc 1260 

agcacttbgg gacjgocaagt ^uggtggata tctcgag&tc agbtcitttae gatgagcctg l 

accc*acacgg tgaa<it^ccfcc3 tctctactAfe <ao.abacaaaa attggbbggg cttgatggtg 13aO 

tgcacctgtg gtcccaacca c^ttgggKagc tgaggHigggft oaattgcttg aac<ri:!?iggcig 1440 

gbggags^tcg cagLg^gcbg agattgiciact: {sctgcactcc ag<::<>1;ggatD atstaagcaag IBOO 

at tct etc be oaaaaaaata aaaoaaaaaa aaaa 1534 



<210> 33 

<211> nie2 

<212> DNA 

<2I3> Hoino aopienB 



<d00> 3$ 

eigabtogagt gaL«iatbctt gttcttliijtg tcittcattta tacragot^ctg ctccatggac 



# 



m 



rcT/us^g/HTzi 



J 



24 



tactcatctt ataataaagg g^aLtigciijaci? g?rcatgaty« rjig^it.gtgcQt tc^rcagtgtg 

ctagctgr.gg r^t.cbaccoct ttttctctca ct:1-/aft5aaaa cttcccaga^i MK^-oBrt^Txvtg 

tgacsgcatt ttcOTccagg ga^jn^^octttj aatt^tgca^: m t.<jtwaeit.cc atagi^satct 

tcagcacttt aLL.at.h*agi-<i tco^rattctc tgULgnrtott aatattattc nujt.tb^tt.t 

t.ctjctbfrctc astgaaactt tcttr-v.-ucs^b ccittcccaag u.gttgbijtt tcc3?Ttaayck 

gattatctgt cattycagiig ti7cc]tjcatcct ccc:&^lcBa[jc cDcatgcgca tra f;r^t.< : 

hcjtr.-aggc^nc Dccb^tcato taaggcat^gt. tgsgcctgcc a9fi9?;ggcac cccttettcc 

aactttcagg tttaatgcrj: cjeig«iacatt.c gaaggKtcjtt gtctggaaaa gataagLcji.!. 

tttatabttc tttgaattct aggagliLgt.c toccacaaca aal:rt(3?^ctas atcacacttt 

ttaagtccaa tacttattal. c.-rjstoat tctg tggaaatiivnt atactttcta ttiaaL^^^ab.ij l. 

acafctt^atngt. ac^toeittotc^ qrs^ccacttLg gctgaacaca gtLLtutgct bagsjcttaca 

taafctaagsjt tgtaatgtta iittotg^jatc tt t gag gi-^fl.t. bactaBa^rat cracL^AhgAn 

gtaaacLgaij: Macataoct ectcttetU: ggaa&asjatg gal.gctgbct gctaaactaa 

tcaa^ttata gagcctiiagij ocggiitCTt rjt cggcfi.^^ptgc cCKtggtccc: ggfi>s^tttgg 

gaggir:t<a<3g5 bgggc^jsjatc atgagguridg gsgtttgaga Uriil.cc^tggc caacatggtg 

aaaccccatc tctactaaar «rswoacacac acacACdcoc acacacacac a<ruraow.cflici 

gccgggcalig gtsBtasg^sa tctgtggLcrc: r-gg-ct.act.cg ggaggctgag acaj^gagtgt 

oacttcaacc caagaggcag a^jgbfcgcgirt gagcca^^gab catgccattg <iBCft<-^v^Kr.o 

tggaagcct.!^ ggov%<)baga<5T caaaact:c c^es tctcaaaaaa aa 



<2±D^ 4D 

<211> Iftdl. 

<212> DMA 

<2±2> Hrtmi> s^wiens 



COTCccacgc ^tecgcaccr, 9tocc<?tctg tgagctccigt acbsrttctCG gtccctgcaa 
aCaga<:raLg£i tgbci?*c^?^itc tggaaCcaLl. gbgtacsjcct ctg<rta<: C-.o<li tnocatcots 

cbttgbattC taatcacbtU ctAtgcbtccc aCcccrttcrli?* ttcatotacjjg cagagttcgt 

gttttattca UbbfettQ^rca tttggtag^:* tbtogcacta atctgiiotrti?^ «t.«atgaatg 

ctcafflbanac attcgtcta* ttaaaotaaa aoaggaggt-c aggCcatctc accttttccc 
ccatcacSH** ctgoccttaa gtcttL^itscb gciacaj^aaat tagca^^^btg aasbaaoaaa 

ccg^auCTtjjtt agt.agL:a<ri;;a reggae tcttc cactLtbbcia octtgs^caat gggaaacat.r: 

ctgggggcr^g agcibgccasja gggag<-:rtwb gOTsaccggg caii^tstgac basgasjacac 

cgagtggt^a caaacKtcc?* caggi^tcaca gaf.g-Litggac atgataaact ttgcttcatg 
aeaaftbbttg cttcatgaaa atgcsa^tbatg cattactttt. <jc?atgaaba5j cCaaattgaa 

cijgcagaata craliLgtTCca cttggttaaa Lstgofcaaaa ggagattagl. ggocbtgaat 

ttsbftabcjat ggatgcacac caenngggaa aagcactt:gt tc:cttctacc tcgtcactag 

tatcagtttg tgetfcgttac ttccaataga a^ibgcbtcga aaga tgatj^M;? wagggctcca 

ar:«tg<si:?ct tctaa active gtKttoctgo ctgttcaeai* taatcctgca gcttcagatg 

CattgacCtg gat.«gflagcc aacatsaatc dQiacsagtcjtc c<7tgaaKaa*i Act-gaatacb 

LCi^c^^obcRS bgcccggtag crcLgtgtgtt ^jgagggat Lg gc^ggcaoct: G ctctgctcct 

g^tttctgct gttl-t*>?jtgc agagatagca aczagtaacac gactaagtga cccit.ggcteg 
ggaaa«^gr?i7 tctacattggc aaghgtgaaci ggjagccaaais tafcsGOcaflg caCggcggct 

oacgcctgta atccr.a&vcsci bttgggagga l--9*»gstgi5£3C ggatea r.r. bg aggtcogcag 

LCt::gacf&cc?a goctggccaa cal.ggtgoast -ccccat^rtr^b actaaaactq caaaaattgg 

Gt^ggcgtgg tggtgggtgo cCgtagtctr: agcbactcag g2gc*ir:Lgttg«i c«g^ra<aaatc 

^scttga&ccc ggaagacgga ggt-UgcagCK agceaagai.t gcaccactgt actccagcf:!. 

ggj^taacaga gL^mgootcb gtctcaaaaM aaaaaaaaaa aaaa^^^gggg gga^ccaaac 

tgtgbtctat agatgtge$^r ribgogtcjtag gaftgai^i^ttt aatatttagg g?^gaftQaeitg 

ttagatatat ttbbtttacac tcctCgligaa oaccggcatt a»i bggza tapig gaaccttggt 

tttcggsgct ctctgggliLi: Lggcattcaaa aat.tfrtcbtgg ctgggtgcrij» tgcbc«cgcc 

tgtaatccca scactttggg aggccgaggt gggcagatca tgagttcagg agttcaagac 

cagccbgacc aacatggts* iSAOuocatct <r;t«^ct?>«att« taaaaaaatin attagccagg 



120 
IGO 
20O 
30O 
360 

480 

600 

720 

SOD 
960 
I02D 
lOSO 

iia2 



6U 
120 
If^O 
240 
3D0 
3G0 
^-ZM 
^80 
54 0 
&00 
660 
720 
780 

900 

102O 

1L40 

1260 

Yy^o 

1380 
1440 

Lson 

1B60 

16:^0 

16B0 
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ceitcgtugcg jjacgcctgte ntccscogctiQ ctcaggaggr: bgitg^ccujca caatcacttg 17 AO 

aacccgggag gftggaggttg cagtgagctg ii94Jttgcffiac attgcatcc^: As^r^t.gggtg 1600 

pcagwgctgag actccat-ctc ^^tf^^oaactQ aaaaaaasaa « ISdl 



<2t(>> 41 
<211> 1157 
i^212> DMA 

<AO0> 41 

cccac!?cgtc cgatLggg^d. oacqjctgtce tta^tcoctt ttagcagcag fisAftt.titt.tta M 

l-.tt.tgtBtga tgtcactgtt ccaLgtt^aa garatcatggfs giibgtacacwi acgtactgecr 120 

cttacttgct ectgL^ottfli gbgaCgtgtc ac:til;tt5C?iD caaatecafi^ jsa^i b ta«g ISO 
*atttg<?Pt.gtcs cattflttttg ctaliwi^atgt tttatttcai. tt-ctsctcca caatttcagc 

agatgctctt tcattcCgt-^ji tattttgcta t.gcjftci;Rca?3 accctcattg i^cotgtattg 3aO 

9aftc.l;i^0t«ifl caccaatgca gtgr:l:.ut;«ag teitctatgafl tr.«aatggca gtgttcacaL ^60 

gctttcctcc cataacLtaib saagcagsag gLflgttttct ttcccatca^: njab^igccatt 420 

cttttcf?LL« ttbttcata<T ttattttctt Qttaagttat. <r>tgtaiaaaat: aatgcatctc 4rtc> 

bgtcGtctoc tagcaggcM ttgttogagg ttaa^atcwja aattaagaag aagcr(i?»gr;p<> 540 

too 

•720 
7^n 
840 
901] 
96D 



ataaatcagA t.t: bgsaaaca aagtt.£*gng/3 tttttgcaca aatiatpflbac ttacaacaRt 

ttcatcuiaag ccaatttatL ytgijctactc ttaacadt-bo ctttaaagtt aaaaat-.trtr;t. 

ataggcgatL l-.gti^tattat tcactctliug ttbttataat tttgl.ttftggt ttcbtttatt 

TOflffttactt tatgtaaatt: auttggaatl: ttatttafitg aeiaatcagat ttcatcattt 

ctcccccagt t.tbcta©^?t£3 gctrttgai.Ub ttgtttctta gcttgahUgr; btgctagttt 

ttaaatgacfl taaaataUag ^attccfltgag atgcacag^b cbtgagtgcg cagttc:aaCt 

gattttgaliA ftatociaccca Cgtaa^a-.gijc ngctgaatca agai-aaag^sr^ cattctcstt 

acbccagacc ttcccgiig^-.t bcagtagctt tgtccfivptg tt^iggcagca tgtgagacga 

agttaccLei*. gccgbaggca attcr.iib^tg attctacate gtiasl:^5<i?Skta tggtgataat 10 SO 

gttacttLtca tcagaagg^^t caaagtaatg gaccte^iaao ijcaggaaaaa gaaggggtta li4o 
tccagaactt oaagagaact ctcl.cfAwaga ciagaaaagaa gga^?^iacp«a aaaacieui 1197 



<210^ 42 
<211> ^O'P. 
<212:- DNPl 

<212> Romo sapiens 
<4Q0> 412 

Mttcggcac agkttgtgtfc tctinatigtlic! onggbccggc caggetggca gctcctgctg 60 

gtfi^bgttbt cctcstgtg^r bgttbccaac cagcCf:l;tgg bctggtaccc agcaactgcc 12t> 

ttagcagaca ic^^cmcctot agcacrcl_g^ cgacgiiatca gt.ggiit:«tgt- gggcntcatc IBO 

ttc&gotttgb cacacctgga i^agcaaccga ttgcUggctw c^agyttcaga agaccgaagc '^^r) 

gbtcCTtatct gga<^ggbsgg cgacct gcgw gbgcctgggg gtcgggi:g^?5> gMtattggg 3 00 

cacrtgotttg ggeacagcgc ucgtgtgtgg caggtcaagc? btctajiagaa ttaccttatc ?60 

agtccaggag ag^ttttgtgc cLgctcggtg tggagccatg aaggl:£f?^g«it cctccaggcc ^20 

ttutsggggac accaggatgl: gtaccoggtt gtagLrtggeig ctgaaatcca tgctgagct.g 4 BO 

taccsggaac t.tgcatatct agagacaQtig actgagtcac Lggcwutct ctttgctcct 540 



gi.^:i>vc«ggc cagaat.aaag aabagagtgt aaaftA<?aaoa aaaaaaaaaa aaeiaarJloa 
ag 



500 
602 



<:210> 4i 
<211> 2492 
<212> DWA 



wo 99/4^299 
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<2i:'$> g opiums 



CCfiCSJCCTtCC 
aCt(Tt£%CCCE2 

<jtccaactcc 

gtDcttctcc 

ctgtcctcag 
ccaacatcca 
ttatttgatg 
ttactgggtt 

aaaggctgca 

tata^Gcaag 
^Sttttggag 
ttctttctag 

agtLccatgag 
oaccactgct 
gcactttgc?(^ 
gaccctgocc 
cccagctgcii 
c!«ct.tttgaa 
acgtttttgc 
tgticigcictaa 
tgt^gattct 

fettcggtatg 
tactacaaab 

acctsttgsl: 
agbutoaaat 
aaaaaaaa«^<ii 



g^aggaagga 

actgeaaeaa 
gffwtoscagc 
ggattcctg?^ 
bgratugcca 
attciggagcra 
cfcggctgbgg 
?Tgcttgctcc 

atcatgacbg 
gfiiigccgggt 
CQccccgagt 
gci^hcc-gcbo 
ccttgggtcra 
cgtgacrttbt 
Rcfcggcctga 
gaaga^t?;oo 
gc&t&cagag 
tgg^Sggdtb 
sogcttctcc 
cict^ccgt-gb 
ttttagagga 

gaagaagaca 

ccabccabgfa 
atgbgctctt 
cttcatgcag 

actggctutt. 

accacagaali 
Cflcitoactgc 
ttccacccae 
bgr:iccgtgct 

^••ggttgaaCg 
tgaggaac^^ag 
tctgtb^rcac 
aaac'ttetct 
ackoaaaaaaa 



tgcitgatgaa 

atgggf:gtt.ci 
ffccgcGCtgg 

cgtcatcatg 
gcc cceLhgt. 
M^3gbgttC!?a 
attacggat-i.- 
l-:3g^?ctct!^t 
agctgggotg 
AStbcaisogt 
acccac^agan 
^■fi«7Acaggcc 
ctgttttggi: 
LgccOTccca 
ccagcrtccal: 
gcrcttggceac 
ctaatttgtg 
&gc!?*tbtsgg 
gbtttcttLL- 
tgce^tcttct 
tgtctgggcl: 
QCOactgggc 
t.aaagtcftba 
a^ttccotca 
aabaaglicLfl 
ggbffCCQapc 
ettgt.Aaggc 
5Rgttaca@e 
caa »:L^i*^gtc 
gggcaaacgt. 

CCLggagggn 

gtg&aacatg 
t caafragccn 
btgetgacaca 
gtcccrttgfcn 
bccctt!?tac 
tctictijtcbg 
gag^agaaat 
atnr.igunct 
Qttctctaaa 
^anctoaacaa 



ggaocgtQca 
acctggccgc: 
gagAfacttggt 
atgatggtgl. 
Gf99g^^bcB^cc 
ngccagacaa 

caLcctgi^tt 
ctacatccgg 

tsggtdcctc 
gtt<:rft^<-t.tb. 
gDtoagctcc 
cctggggoco 
ggr-ccatigct 
cccttLcctg 
ttttcraaeag 
actgar.Loc:© 
flcctgajjact 
actt.tgl:l:.tg 

eg^ggccctc 

tccacrtigggr: 
t-gc?abcccag 
aaggc caeca 
tgggc>srtGcc 
ttccateatc? 
5cc?tg-ccttg 
agtcaanMa 
atbtggggtg 
cac1.ggt.ttt 
ccaatttgcc 
ctLcrotcsacg 
togccaggga 
cagatcgc:to 
tsjctagattt 
CC&LifcbtBt 
bttactttae 
a<:raLl.ic;tgot 
tgctgcgaac 
ccccflstbts 
ggtttatggir: 

aa 



caccattcga 
bgwgtttcgt 
tcatggtgac 
yg^tggtcac 
ttcctggcliQ 
gggatggggg 
ggcctcggtcr 
^-l:g»«bagca 
etgttcggcg 
t:l*.oi3tgtwtg 
gtgaacctgc 
LKc>tttfccitg 

f^t.cccc bgcc 
cctcctccgt 
tg^c;tgagat 
atccctcctc 
gtbgtis&gaciR 
gtttggagat 
crcc ttttoct 
bctggattaa 
atgLglitgga 
agaacaaab^ 
ctgg^ijttgg 
oOTatctctt 
gaciiaggsgg 
caaaagcttc 
ft^T^jigggngtg 
ctt^gaaaat 
?^cttggtatb 
Kggcgcttcc 
•iggagaaacic 
aagggcataa 
tgtatgatgc 
tcacttgtat 
gocsctgabcct 
atgaatttgt 
rj^agcattoac 
gj^cbctcaga 
gatt-gttbtc 
aaaaaafs^tft 



<:rrti-<L^cccbcg 
cttatactcc 
gc-.ttgott^^ci 
aatcabtggt 
^.^bgggaQcog 
acat^jgctgrc 
■LcircKibgggc 
COgRgctijat 

ccatgtgcigg 
tgcaatacga 
crI-Ctwbgt.g 
ccaccctcgc 
gLg«9 9gAc«Lt 
t-ctagaaaaa 

bagagaggasj 

gatgcrr.t^gg 
orttCtcttD 

gbatgcat«TST 
catgacatLL 
gggggcacaa 
tctcactagg 
cccccagtto 
cccrgattgtg 
ttcccagagsc 
ggagaga&gc 
gagtacactQ 
gcoat^g<:i::«gg 
agagaactj^a 

gt.ta1:i:gt:gfa 
t^aattaCta 
tiLattg<sattrj 
•cttatcactc 
aaaaatacou 
iTEiaCggj^gca 

ttggacggat 
caaaoaaae^aa 



CO 

leo 

24 tJ 
3D0 

260 
^20 

540 
600 
6&0 
720 
780 

50O 

1020 
lOBO 
1140 
12D0 

i:^en 

1320 
13«D 
144D 
ISOI] 
156D 

IG&O 
1740 

leoo 

IB^O 
1920 
li^BO 
2040 
21D0 
21£0 
2220 
22fi0 
2740 
2dD0 
2<iS0 
2^32 



-c210> 41 

<211> 75 

<212> PRT 

<212> Romo aaplens 



<220> 

<221> SXTB 
<222> (7b > 
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<223> Xaa equaL& £:Lop hrnnslation 

Uet Lft%j Lbu hlct Ala Met Tti> Tttp Ala CyB Val Thr Thr Leu Val 

1 F> 10 is 

Dhe Tlir Leu val Ser Lyw Leu l?he lie Pru Lou Lys Ser Ser M*si. A!>ij 
20 2S 30 

Gly Glu Mot. Ser Leu Aep Pro Hl^i i5<;r Cys Val Leii Val Cys lie eye 
35 4D 45 

Ph© Pro Jj©u Are Fhe Val PJn^ Val Sex Cyet PIjr 01 \i Leu Tyx Leu Vfil 
50 55 

Gin £er lie Val X^i% Ijc?ii .9er Glii Gin Lm\ Xaa 
£5 70 7b 



■!:2t(>> 15 
<211> 79 
<212> PR-T 

<2i$> Hc^o Bap lens 
<22D> 

<22i> SITE 
<22:>> (76) 

<223> Xaa eqaaly r^tci? txanfllation 

Met Af^p Ala L*he Ala Gly Si-ir Pro Plie Ser Leu M^ft Vol Pro Lye Cya 
15 10 IS 

val Leu Xle Leu Phe Cys Leu Val Tyr Ser Leu Gin Cyr> JJ.o <Jln Pro 
20 25 

Tyr S<?r S^tr Leu Leu Aen Scr Ala Ser Leu Pro Tyr His His Gly Lftu 
^5 dO 45 

Lya Leu Ala A^sn L^u Leu Leu rift Val Phe ryr Ptcj Hani II© His Ser 
50 55 GO 

He Pro Phe Sei Ser s^j- His l^o Ser iyfi I-<5su Hie lie xaa 
65 70 75 



<710> 4G 
<211> 
<212> PRT 

<212> i^oniQ 3eipienB 

<220> 

<221> SITE 
<222> Uli 

<223> Xaa equals: .stwp trans iatioirt 



wo PCT/US 1 

2& 



Met Asp Leu Leu Glii V*sn Cys Phe Phe L^!v Phe Phe Ser Hie Leu T^t* 

15 10 IS 

Ser Trj* Tlu ("jiiu Cly Lye L^u Pjtj Cys Asn Phe Pi^o RJ.y Ptd Val Gly 

20 25 3D 

Arc Val l^^e L.eu Eer Pra Phct Oln Dtet Leu «31y Phe Lya C5ln Xaa 

40 45 



<5!10> 47 
<211> 102 
<S1?> PRT 

^220> 

<22l> SITE 
o222> (102) 

<40O> 47 

Met Ala Pile Tzp Ph^ Thr Gly Leu Pto Leu Leu Ser L*u lli- Leu 
J- 5 10 15 

Qy£s Tl^s Cly Arg Val Phe Tjc^h Gly Val Gly Glu Scr Phe Ala Ser TThr 

25 iO 

Gly Ser Thr Leu Tj^p C.ly lie Gly Le\2 V*il Cly Pro Leu HIb T-hx- Ala 
35 40 4.^ 

Arg VAl lie Ser Trp Aen Gly V«l ALa TThr Tyi^ <5**y Al« Met Ala Ala 
50 £5 SO 

GLy Ala Pro Lt^i? Gly Val Tyi Le^:u Aran Gin His Trp Gly Lijy Al<» Gly 
65 70 75 ^0 

Vol Ala Ala Leu rift Veil Leu Ala Val Al.^ Val Ser Leu Trp Leu Ala 
S5 50 gs 

Ser Ala Asn Pro 'ilir Xaa 
100 



<210> 4fl 

<21l:^ 382 

<212> PRT 

<213> Idomo Bapiei^c^ 

<22D> 

<221> srTJB 

<222^ (67) 

<222> X^ifl ^Ti^uals any of th<t naturally DccurWjig fi- amino aci<3a 
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<221> SITE 
<?,7.:t> [139) 

<223> Xas eguale: n>-C tbe iiatuirfil.l y occur jrin^ Tz-omino acids 

<.220> 

<223> Xaa <^i>9L& any of the jftAhurally occutriiis h amino acicfft 
<22CI> 

<221> SITE 
<222> (1.65) 

<223> 3(aa equals any At the naturally oDCurring Li-ainino a olds 
-=22 D=^ 

<221> f^ITE 
<23Z> (194) 

<222> xaB ftCjUMllrp ^aTiry of Hhe nal.u-rally occrurx-iog L amino &c:iii:? 
<220> 

<^:21> SITE 
<:222> {344 > 

<9.7.^> Xaa equalB any oL bbe naturally occurring! L-Binino Acids 
<220N 

1.22 1> SITO 
<2!22^ (361) 

•c222> Xaa e<iu»1e any of the nalixi rally occurring L-jimino acida 

<22D> 

<2:3L> SITK 
<222> (3B2) 

<'22'^> 3Caet acpials stop brnn&lation 
<400> 48 

Met Phe C3t.i^ Val Arg Pro Gly Trp Gin Leu Lftu Leu Val Met Ph#t Smt* 
1 i 10 15 

Ser Cyfl Ala Val Aan Gin Leu Leu Val Trp ry*' Pro Ala 'Jlir Ala 

20 2& 30 

Leu Ala Aan Lyb Fxd Ala I'ro Asp Ajxj At^t He Ser G1y His 

*5 

Vttl Qly lie lie Ph* Sor Met Ser lyv Leu Glu Ser Lys Glv Leu Leu 
50 5^ 60 

Ala Thr Xftn Ser Glu Aesp A-rg .S^er Val Are He *rrp Ly^i V5>1 Gly Ajgip 
65 70 75 60 

Leu Axg Val P^x> Gly Gly Arcf Vwl Gin Aen He Oly His Cya Ph(& Gly 
as 90 &S 



Hi A S«j.r ft la Arg vai Trp Gl.n Val Ly:? l^eo* Leu Glu Aetii Tyr Leu He 



30 

100 105 110 

Ser Ala Oly Glu Aep Cyf; Vwl Cy& Leu Val Trp Sot His Glu Gly Clu 
lib 120 125 

lie lisu Gin Aicfi Pbe Arg Oly Hie Din Gly Xiici Olv Kaa A^rg Ala Id^i 
130 i:*5 I4n 

Al« Ala His Glu Arg Gin Til^ Trp Vcul lie T^r Gly dly Asp Ahp Ser 
1^5 150 155 ItiO 

Arg Hie Ajf^g L&u :<49a Hi a Leu Val Gly Ai^ £j£ly OVr Arg cly Leu fiiy 
165 170 175 

Val 5er Ala 1^ Cye Phe L^yv Se*r Auq Sei* Arg P/.u Cly Thx Leu Lye 
iflft lft5 ISO 

Ala Xaa Thr Leu Alta Qly ger Trp Arg Leu T^enm Ala Val Tfer Aap Thar 
195 200 

Gly Ala Leu Tyr Leu 'lyr ABp Val C5:iy Val Lys Cye Trp GHu Cln Len 
210 :J1& 220 

Le-ci <31u Asp Lye Hit: Phi- (Jln Ser 'iVr Cya Lfti3 Leai Glu Ala Ala Pro 

230 235 240 

Gly Pr<i Cilia Cly Phe Gly Leu Cys Ala Met Ala Asn Gly Glv Oly Arc 
■^4S 2?>0 25S 

Val Lya Val Val Pro 11« Aen Thr Pro T>ir Alo Ala Val Asp Gin Thr 
260 2G5 270 

Ley Pho ?ro Gly Lys Vel Hi.& Sex Leu Ser Trp Ala Lev aa?El Oly 'lyr 
27S 2fiQ 2B5 

Glu Glu Leu Leu Tji^u Leu Ala Ser Gly Pjo Qly Gly vai Val Ala cya 
590 3O0 

Leu Gli> llp ."Sfer Ala Als Piro Ser Gly Lye Ala lie Ph^* Val Lye Glu 

310 315 ^20 

Ajtct CyB Arg Tyi* Leu Leu Pro Prxi Sftj- Lys Gin Arg Trp His fPhr Cy;; 

^25 3iO 335 

Ser Alfl Phe Leu Pro P™ <31y Xaa Pbe Leu V^il Gly Asp Arg Ar^i 

^^0 345 350 

Gly ser Vsl Leu Leu Phe Pro SG*r Xaa Pro Gly Leu l.Kn? Tjys Asp Pro 
355 ^60 365 

Oly Val Gly Gly Lyg Ala Ar« Ala Gly Ai*a Gly Ala Leu :saa 

3fl0 



r210> 49 
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<2Ll> d6 
<2L2> l^KT 

<2id> 

<221> £iJTE 
<22^> 

<223> equals £CDp (.iron&lcfetion 

<:400> 49 

Met Gin Ly/j Lyft Lys Leu Val Cyc Pyj.- T.ibu het Leu ADf<f filn Tyr Phe 
1 & 10 15 

Pfee Lp« Vwl VaL Val Sex Leu Pro Prp l^^ro Cy*j V»t Leii Phe Gin Met 
2D 2B .^Q 

Hie Tyr Pro Arg Thjt^ V«L Thr Pro Thr Thr Glu Tyx Xis^s 

95 40 45 



^210> 50 

■s:212> PRT 

<^13> Hocno sapienB 

<221> SITE 

<223> Xae egualf; aiafy of the nature.]. 3 y occurring L-amino acridR 

<220> 

<221v SITE; 
<.222> {6d) 

<7^'^> X^CLJi ^uoils any o£ b.Jio nattiralLy <:>cciiiring L-zimino a<;ida 
<22l3- SITE 

<'^:?:s> (1323 

<223> Xaa eqiifil-* «ny of the inatD^nl ly occur ring L-am±3ii> tacnc^s; 
<400> SO 

W<sit Vftl Thr The Ala S^ix- S^r Thr Leu Tjrp Jl^a Alci Ala Phe Ser Tyx 
15 10 15 

Met Btet Vii.l Trp Met Val Thx Jlc ILe Gly Tyr Thr Leu Oly He rro 
2D 2h 3D 

Ajgp Val lie Met Gly IJ.^ Thr Phe Ma Ala Gly Thr Ssr V<;tl Pnn 
35 40 45 

Asp Cy^v Mot Ala Ser L.eu lie V«l Ala Arg Glji JCbir Met Gly Aep Xaa 
50 55 60 



Alo Val Ser Aen Ser He «3l.y Ser Afsn V?s3 Phe Asp lie Leu He Gly 
65 70 75 ao 



# 



32 



Lbu Gly L«ii PDfD Ti^p Altt Leu Oln 

Tyi^ Tl.f^ Axxj Leu Asn Ser Ary OTv 
IDD 

Leu AlQ Ser Val Phe V«3 Thr Val 

Gin Leu Aep Xmj9 1^,5 Leu Gly cytif 
130 135 

PliK Ti^^v Cvs Itie sier lie MeL Thr 

Asn Leu Pro Met Cy££ Asp JflXE 

L65 



Thx Lfctu Ale* vaL Aep Tyr OLy Ser 
90 SB 

Leu Xle Ty*^" *?t"iir VaL Oly Leu Lftu 

Pbe fily Val His Leu Asiu Trp 
123 

(31 V CyB Leu Lieu Ticu lyr Gly Val 

Glu The A ATI Val Fhe T-hr PJits V«»l 
155 160 



<210> 51 
<211> 50 
<212> PRT 

<21^> HcKxno BiXE^iens 
<22tl> 

•<221> SITE 
<222> <50) 

<223^ Xaa eqfualc ±^t<.7p tranBletion 
<4IO0> 51 

Met l^r- R^sT- Val iTo Leu AIa Thr Phe Bex Vel Trt3\i Thr lie Ala Lieu 
15 ID 15 

Arg Ala Gin Vai l.csii Lye Leu Val Van Leu Ser Phe Vj^T Ser Alo Phe 
20 25 20 

Ser Pi^<> Val HfiB Ty^ Pro Pro Leu Leo Ii^u I^s Gin Ser Aifg Leu 

15 45 

Abu ^aa 
50 



<21D> E>2 

<21r> 41 

<212> PRT 

<2l:^> KoniD saplencs 
<220> 

<221> SITE 

<2:i:2> (41} 

<223> Xaa et^i^alB etop transititicin 

<4D0> 52 
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Uet £jeu Cy;L? Asp Lbu He Leu LeiL The Aen 11.^ Lvb 19et Ala. Il^ Tyr- 

1 S 10 15 

HAc; Leu lie He Deu Gin Phe The Cyt ^Tor Val Cya Ser Giu Ptp Asp 

25 30 

'rOir Ala L*u Ser Tic fler Pro LieeM Xgq 
35 40 



^210> 53 
<2ll> 95 
■c2L2> PRT 

<220> 
<222t^ (9S) 

<233> KetQ equals tranalatiiMs 
<400> 53 

Met Leu Leju Ser I'he Tyr cys Leu Pro Melj: Vol Ser lie Hift Tie Ph^ 
IS 10 IS 

Ph€t Pro CvB Ala Hi:; Cys- Val Tyr T.ir^u Leu Hie lift Scr Cys Ser Leu 
20 25 i& 

Gly Glu Glv Ser Phe Aan Jljr«j Abp l^ir Cyfj 7*yH Lys Asp P)Mr eye Phe 
35 40 4» 

St^r II© Gin Aen Val Asn Ser Tlir Phip Leu Leu Ser Lku AIq Vol Phe 
50 55 &0 

A^cfif Phft SOr Glu Arg Pbi* Ser Ase> Se^r Asxi Phe Leu Phe Thr TPu* Pzo 
65 70 75 BO 

Pro lie Cys s^i- GLu LyB Aen Qly Leu Leu Ty-i* Hi& Trp Lie Xaas 
B5 90 95 



•c210> 5d 

<:211> 4135 

<212> PRT 

<213> H<!b(nii? sapiens 

<220> 

<221> SITE 
<222> 02Z) 

<223> Xaa egtja:iL=^ any of the naturally oiiicurrinc L- amino ^t:l<h:i 

<220> 

<222> 

<223> Xaa equalf: «ny of the naturally oecuirin^ L-aiiiinD acitlH 
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•c220> 

<22l> SITR 
<222> (374) 

<223> xa^ ciny of the natuirally oc?currinq I^-amlncn a eld:; 

<230> 

<:^^21> SITE 
<222> (dB5> 

<:?.23> Xncfe egu^Is Btop transHai.aijn 

<40O> 54 

Met Val ALa Thj; Vtf i Gly Leu Leu V^i.l. Ph© Leu Ber Leu GJ.y 

15 10 15 

Vrtl Pro I'ro Val Arg Cys L&u Ph^ A La J^eu Ser Vwl Pro Thr Leu <^ly 

25 3D 

Met <31u Gin Gly Ar^g Ai g Lou Leu Leu Sejf T^r Ser fJlir Alas Thj:- ln^vi 
145 

Ala lie Airt Vnl Vol Pro Aen Val Lau Ala Asn Val Giy Aln Ala Oly 
50 M 60 

Oln Vnl Leu Arg CyB Val T>jr 01 u Gly E3er Leu GLu Ser Leu Leu Aen 
65 70 75 80 

T*hr Thr Has Cln Lto HiB Ala Ala Ser- Arg Ala Leu Gly V^x^o Thr Gly 

R5 SO 95 

Gin Ala Gly Eer Aif^ dly Leu ^lir Phe Glu Al<9 Cln Asp Aan Gly Ser 
lOD 305 110 

Ala Plic TVr Leu Kie Met Leu T>n- Val Thr Gin Gin M^l T^u Giu Abp 
ilB 120 125 

Ph© aer Gly Leu Glv S<?x Lbu Ala Arg AIa AIq Ala Leu Gly Thx Gin 
1.30 IS:* 140 

Arg V4iJ Val Ttir Gly Leu Ph^ M^^t- Leu Gly Leu Ley Vnl Glu Ser Ala 
"5 150 15-5 T6n 

Trp 'ivr Leu Hifs Cy^ Tyx Leu TOir Aep Lm> Ars l^he AeE> Aen Ilrt Tya- 
165 170 175 

Ala ThT- Oln Gin Leu OOir Gl.ji ftrg Leu Ala Gin Ala CSin Ala Thr Mis 
IHO 1B5 190 

Leu Leu Aija Pro Pro t>ro Thr Ti^ Lev Le^u Gin Ala Ala C^iri Leu Ars 
155 200 205 

Tivu SoDT Gin Glu Glu Leu ].i3u Ser Cye Leu Lw Ara Leu Gly Leu Leu 
210 215 220 

Ala Leu Leu Lc^u Val Ala Thr aIa Val Ala Val Ala rhi- Asp Jiis Val 
225 230 7.A0 
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Ala Phe Leu Gin Ala O^i^ Va3 A&p Trp Ala Glfi ljy:> Le^u rxo 

245 250 255 

T-hi V^l Pro lie l^r Leu TJii- Val Lvs Tyjf Afcp VeL Ala Tyx Th^ Vfil 
260 26^^ 2VU 

Leu <3ly Phe iLe Piy> Phe Leu Phe Asn Gin Leu Ala Pro Giu S*9r Pro 
275 260 285 

Ph« Lpv Sor Veil HiB Sejf s<?r Tyr Gin Trp^ Glu Ifcu Arc Leu H-iic ser 
2i?D 255 30O 

Ala Ars3 cys Pro Leu Ltm Pro Ala Arg Arg Pro Aig Ala aIa KIt^ Pro 
30fi 310 31Fi 320 

Leu Xaa Ala t^Hv C^l-y Leu Gin Lru Lou Ala Gly Sti-r Thr Val Leu Leu 

32S i30 335 

Cllu Gly Tyr Ala Arg Ar^ Leu Arg iCaa AIa He Ala Ala Ser Phc Phe 
0^40 345 350 

rhr Ala Gin r3l3u Ala Ars? Ar© Ilt^ Arg BIb Leu Hia? Ala Area Leu Gin 
355 3&0 365 

Arg Ar^j dia Asp As^g Xan. Gin Gly Gin Gl.xi Leu l^ro Leu Gly Ar?£j Prw 

370 y?^ 

Ser Cyis VpI Pro Thr Pro Ajhj Pxo Ali» Cya Lyft Pttj* Pro Ala "J'rp He 
385 350 *00 

Alis 'JVr Arg Leu A-yp Ala Leu Arg •Thr Olu Ser Ser Glu Gly GJi» Gly 
405 

Lye Glu T.ir#w Trp Ser CyE At^ Asp Leu iier cy.'j Hj.& Leu Giv Pro Val 
420 425 430 

Pro Pxo Pro Cye Val Thr Leu Sly Lye Leu «ifl Leu Ser filv Pro 

435 ^40 445 

Arg Phe His Leu Hie Aczi A&c> Ser iLe Phis Tbr lie Asp VaL O'hr 

450 455 460 

Tyr Phe Pro Aat^g Axg Asp Val val Ai-g Met Glu Gly Ar>7i Thr Gly His 
ifiS 470 475 

Aep Airg Pro Gly Xea 



4B5 



<:^10> 55 
^211> llii 
<21X> PRT 



'c213> Hocno £:api^B 
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<:22D> 
<221> SITE 
<222> aiJS) 

<222> x&a, ftcpjpls stop translaK^ on 
<400> b-S 

Met l>ro lie His Lys Tht^ fc/.g lie Ser Cy-. v^l l^e Leu Leu L^u J5<L:sr 

Leu Lyc rr^p His Trp Met *hr Asci <^ly Lys Leu At^ii AIr Alci Leu Aan 
20 25 30 

Viil Pino Lw Giy Fhe Arg Gly pfae Gin Sea^ Gin Txp l^ir Gly Gly Giy 
is 40 

Leu Cy« <5Ln CyB L^u S*%r Cly Val cye L^^w Cys Hie Cyf; Qly Ala aId 

" 60 

Trp Al6 Till- Arap Leu Oly ADf^ -Phr Leu Gly Aep <3ly Ala Pro Val Trp 
65 70 75 SO 

■TxxJ Val cya Val Gly St^r Ala Val Pro Vul His Val Arg Irys Ala Leu 
85 90 35 

L^u Lsiu TVr Ihr Glu Ser Cyjs Sear Leu Ser Thu Thr Abi3 Arg Ser Prv:> 
iOO los 

Leu l^DTD xa.a 



•(r210> 56 

<21i> 5a 

<212> FRT 

<21^> Hcmo sapiens 

<220> 

<:221> SITE 
<22£> (SO) 

^223> J£aa equf^Ls st^yp translsLioa 

Mel- 8ot Arc Ala Pro Cyi^ Ala Ser sejr IIk t.^^u Val Leu Thr Lau Tl.ts 
^ 5 ac> 15 

Val rhr Lftu Val Leu Leu Cyjs Sex Val Lye ll© Cys Aen Trgih Leu 

20 2b 30 

Ars lie Tlir Val Gly Val His Ser Tyi" 6er l^r Lyr- Pro Gin Val 

35 40 45 

Phe Xaa 
50 
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<212> i'KT 
<220> 

<221> SITE 
<222> \172) 

<^2^> Xaw e^ofelB stop tranftlaticn 
<40U> 57 

Hat LyE LyE Cya Lfcu I.<3tu Pro Val Leu T3.i^ Thr Cys Mfet Giu Thr Ala 
15 LO 15 

lie CyA lf/:j Asp Arg Met Met M*-t. lie MeC lift l^tiu Leu Val Aen Tyx 
20 25 50 

Arg Dro Asp Glu Phe 31^? Clu Cys Glu At^p Pro Val AEp Hi:^ Vol Gly 

40 45 

ABn Alft T>ir Ala Ser Gin Glu Low Gly *J*yr Gly Cys Lou LyB Phe Gly 
50 55 60 

Gly GLn Ala Tyr Sej- jEisp Vnl Glu tlte Tin- Ser Val Gin Cyi; Hi.e Ala 
£B 70 75 BQ 

Leu A5jp GJy Tie Glu Cys Alft Svt Pro Arg Tlir PU- beu Area <Slu Asn 
fih 90 9.5 

Lys Pto Cye lie Lye Tyi- Thr Gly Kia Tyr P>io lie "Jlir O^r Li^u Lftu 
lOU 105 110 

lyr Ser Phe Phis Lou Oly Cye? Phe (31y Val Aap Arg Pha cVs LpSU Gly 
115 120 L2B 

His Thr Gly Tlir Alei VrI Gly Lye Leu Leu Thr Leu Gly Rly J^k Gly 

1:^5 140 

lie Trp Trp Phe Val Aefj Leu lie Leu Leu Ilo Thr Glv <jly Leu Wel. 
145 150 155 160 

Pro Ser Asp Gly Sot Asti Trp Cyf. T>ir Val 'Tyr Xafi 
165 170 



<2I0^ 68 

-;211> i2!i 

<212^ PUT 

<213> Homo sapiens 

<220> 

<221> SITE 

<222> (IPI) 

<223> icaa equal:? any of the »<it.urally occurring Tj-cunino acids 
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<40O> 53 

MeC Leu Sei^ Gin Pro Axa Met Glu Si^r Lew Asp T^ir Pro Alw ai.o. Tyr 
15 10 15 

5r!j- T.OIU Cly Leu Ala Leu Ii&u (Jly Leu C51y Val v^il T.cu Val Leu Ser 
2D 2S 3D 

Ser Phe Phe Al& Leu <5ly Phe Ala Gly Thr Vh^ Leu Gly Aep Tyy Plie 
3S iQ 45 

t3.Ly Xl.« Leu lib^s Glu Ala Ar>9 Visl Thr Val Pbe PrY> Mic Asn He Leu 
50 55 GO 

ABp Asn Pro Met 1^'- Gly Sex Thr Alu Asn Tyi Leu Gly Trp 

70 7S ao 

rife Mfti. Hnrp Ain £j©r e^ro Thr Gly Leu Leu Leu Thr- v?^l Leu Val Ala 
as dCf 

Leu Thr 'lyr rle Aii» Leu Leu Tyr Glu Olu Pro Phe Thi: Al^ 

100 n05 110 

He Tyr AjrQ 05x1 Lye Ala Ser Gly Srr Bis Lye Arg Bur 
lis 130 135 



<210> 55 
<211> 311 

<212> PT^T 

<213> HamD Bapieai£f 

<221> SITR 

<223> X^M wiunlB any of thft n<?t.uxally occudtj^Iji^ L-tnRj.no eicidB 

<22D> 

<221> £;ITE 

<222> (311) 

<223> KBa equala i^top tranelatiDn 

<400> 59 

btet Leu i.K5fu Trx) Leu Lfeu Gly ^rp Leu Glu Cyts Vtil His Asn £jer Arg 
1 5 1.0 Iti 

Ar^ Ser Gin Gly L«v Pro Pro His Tyx- As;p Aep Val Glu Val Pbc lies 
20 25 30 

Leu Gill L«u Cilu Gly Glu T.ys His 'J'rp Arg Le^j lyr Mis Pro Thr Val 
35 ^0 dt> 

Pro Leu a1j9 Arg C'lu 'lyr Ser VaJ Glu Ala Glu Glu Ai^g II© Gly Arg 

5S GO 

Prci V?il His Glu Phe Mel: Lwk Lys Pro Gly A-?p Leu Leu Tyr P^c; Pru 
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SB 70 7y ao 

Ara Gly *hr lift C!ln Ala Aep THr Pro Ala Gly L<©ii Ala His Ser 

8S do ?5 

Thj^ H^!^ Val TJir lie Eei* TIit- Tyr GLn am A&n Sei Trp Gly RjEsp Ph^t 
lOO 10? 110 

Lau Leu Aep Thr III;? Ser Oly Leu VmI P}ie Aap Thi: Aln l^ys Glu A&p 
13 S 120 12S 

Val Gill Lev Arcr 'l^r Gly I'Jr. Pro Arg Gin Tj(?u beu Leu Xati Vol Glu 
13D 135 1*0 

Ser Tlir Thr Val A].rt Tlir Arj? Sex Gly Ph*^ I^etu Axg Th.r Ijeu 

Ids 150 IGO 

Ala Asp Ai-g Fir-y Glu Cly Thr Jiys Glxj Leu Iftr.xt Sei Sex Aap Mot Lys 
165 no 175 

I-y:; Asp Phe lie Met Hi s Ar^ Leu Pyo Pro Tyr Eer AIa Gly As]o Gly 
IBO 165 150 

Ala Glu Leu Sftr Thr l*ro Gly Gly Lys Leu Px^i> Arg L©u AEPp Eer VaI 
1S5 20D 20S 

Veil Arg Leu Gin Phe TyB Asp His He Val Leu Thr V^*:! Pro Asp 

210 215 220 

Gin Afifp Gin Ser Aap Glu Al(* Gin Glu "Lys Web Val Tyr He Tyj; 
225 230 235 240 

Ber Lbu Lys Aet) S^dx Axct Glu Thr Uast klet Net Gly Asn Olu Glu Glu 
2d5 2150 :?55 

Thx Glu PJuT His Gly Leu Adcxj Phe Pxo Leu Scr His Leu Aep Ai<a Leu 
260 265 270 

i-ys Gin iLe Txp A&q Ser Pro Ala Ho 3©r Val Ly?^ Asp Leu Lys Leu 
275 380 2S5 

Thr ThiT Asp Glu Glu Ly« Glu Ser Leu Va3 Leu 5er Leu Tip Thr Glu 
2SD 255 .iOO 

Cys Leu He Glu Vol Val Xaa 
305 330 



<210> 60 

<2tl> 1*4 

<212^ PRT 

<213> Ktmct sapiens 



<:?20> 

<221> .^ITK 
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<222> i2) 

<223> Xi^a ^util'ji any of the natur aJ occurrinci L-Amj.tio cicias 
<220> 

<221> 51'rK 
<222> (Lb'd] 

<223> X^^a. equala stop t:i^«ri;t;l?ition 
<^00> -60 

wet xaa Gly Lftu 1.i5t> Lbu Ala Ala Ph*^ Ala Leu Val S«t; Pxo 

:» 5 10 15 

Axxj Al^ olr> Ala Val Trp Lau Gly Jsong i>eu Asp Pro CUxi Gin Leu Leu 
20 2& ia 

<51y I'ro Trp Tyr Val Lkfmj Ala Val Ala Sei- Ars tSlu Lya Gly Pho Alo 
S5 iO 45 

Met Glu Lyrt Ajl;p Met Lye AEn Val Val Gly Val Vm.i Val Thr Leu T*hr 
50 55 60 

Pro Glu Asn Asn Leu Arg Thx Leu Ser Sei:^ OLn Mis Gly Leu Giy aiy 
6S 70 75 

Cyep Aep Qlln s?:»jr VaL Met Aesp L<nJ T3© Uys Arg Afsn ficr Cly 'i:rp Val 
85 90 

Fh© Gl\i Asn Pro Ser ll^i Cly Val Leu Glu hovi Trp Val Leu Ala Tlir 
100 105 

Asn Phft /irg Asp 'fyr Ala lie Ph« 'llir Gin Lftu oiv Ph© Gly asp 

lis 12D 125 

Glia Pro Phe Aan Thr Vi?! Glu Leu Tyar Sex- I.esu llir Glu a*hr Ala Se-i^ 
l^tJ 13S 

Gin Glu Alia Mfst Gly Leii Phe Tin- Lys l"rp Ser Aiy ^;ctr Leu Gly Phe 
=145 

Li;;^u Ser Gin JCaa 



<210> 61 

<211> 24D 

<21Z> PRT 

<213> Kocno ae^iena 

<250> 

<221> DITE 
<222> 1240) 

<223> Xaa equal :i stop tranelatican 

Met Ai>9 AlA Lou Arg Ajrg Leu tl© Gin Gly Ary Zlt^ Leu Leu Lau Thir 
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15 ^0 

lie c:vB Ala Ala Giy Tie Gly Gly Thx Phi? Gin l^he Gly Tyir Aan Tjki± 
2G 25 30 

Sfej:^ X3 i;i Tip Asn Ala Pro Thx.- "Le^u Kio J:Le Glii Rlij E*hf3 Thir Asn Glu 
^5 dO 

Thr Trp Gin Ala Arg T»jr Gly C5l\j Pro Leu Pt-q Abp HIb Leu Val Leu 
50 5& 

iieu Met Trp £3fti- Tiou ILe Val Ser L^pv Tyx Pro Leu Giy Rly Tjpu Phe 
6.5 70 71? ftO 

aly Al.^s l>cu Leia Ala Gly ^to D??u AIg lie Thx^ 3.cii Oly Axg Lye Lys 
ft.^ 90 95 

£f©r Leu Leu Val A«ci Asn lie Phe val Val Ser Ala Alii Xl.ci T^coi Phe 
100 105 110 

Gly P>i<ii Bbt Are Lys Alfi C31y Ser Phe Glu M^?t. II© Met Leu Gly Arg 
11£ 120 125 

Leu Leu val fily Val Asn Ala Gly V*!il Ser Met Aen Tie Oln Pro Met 
J.30 135 140 

Vyf iKAJi Gly C5lu Ser Ali^ Pro Lys Glu Leu Aig Oly Ali* Val Ala Met 
145 L50 155 1.60 

Sex Ser Ala lift PH*? Thr Ala Leu Gly Tie Val Jttet Gly Gin Vtil V!>> 
165 170 175 

Gly L^ S^r Thr Thr Ala Alw Pro Gly Leu Arxj OLy Leu Gly Arw Gly 
IfiO la^i 190 

Alu Gly Gly Al/^ Oly GLy Gly Alfi Arg Cys Leu Pro Giy LOfu Pro Qys 
1S5 20O 205 

Px'O Alo Pro Met Gly A La Val Pro Ala S?ex- Oly Pro Glu Glu Thr Gly 
210 21!i 220 

Asp Lys P>;<? Arg Gly Bex Gly Gin Cys His Gly A.lu Leu Arg C!lu &aa 

7.30 235 



<210> 62 

<211> lin 

<212> PRl' 

<213> MDmi> ^iQpieiiB 



<220> 

<25>i> si'ne 



• 
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<222> {13 D> 

<^P.^> Xaa <3<3uaL& stop traneL«tion 
<dO(J> 62 

Met Glu Arj3 Trp VaL Asp Aap Ala phe Trp Sor Phe i^eu Phe Ser Leu 
J- 5 10 15 

He Leu lift vaJ. Tlo M©t Phe Leu Tx-ji Arg Pro Ser Mia A&n Asn Gin 
20 25 30 

Aro Ala Dhe Met Pro L^u xi© Asp he^ Sftj- A.gp Aep Glu lie Glu 

3S 40 4b 

Glu Phe Met Val TVir S©r Glu Aeih Leu Thr Glu Gly Ilrt T-ys Leu Arsj 
50 .S5 GO 

Alti ^ToT Tys fi^r VaI Ser Aen aiy Thr Ala Lyss Pro Ala rhzr Seir Glu 
es 70 75 BO 

ABn Phe Aep Glu Aep l.tju I^a Trp Val (Jlu Glu Aen lift Pro Sex Sex 

8J> [JO 55 

Phe Thr a^p Vol Ala Leu Pro v^l Leu Val Aep 5cr Asp Glu Glu He 
100 105 IH) 

Met Thi Arg Sejf Glu w^t Ala Glu L^^: Met Phe ier Ser Glu l^sis 11<^ 
T15 17.0 1,25 

130 



<21D> 63 

<211> 61 

<213i> PRT 

<212'> E3QSnD £:apj.<;vL& 

<220> 

<221> SITlS 
<222> tSl) 

<223> equals etop tr^inslation 

<4D0> ^3 

net Phe Glu Cy& Val He Leu V&l J5or Phe Leu Val Vwl Fhe Val val 
^ ^ iO IS 

Val Aif«g Cy& Val Gly Leu Jl^ pro Thr Gly Gin flTer Lye Glu Ph« ^in 
^0 25 30 

His Pr^ Lev Pro Ala Cye Ser Qy:s Tyr Pro Thr Asp Ciln llir Leu Aen 
35 45 

Ser Ser TrL> Gly Cy« Cys Leu Ala Pi.o Kis Kis Asp Xa«) 

i^!^ 6n 
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<212> PKT 

<2L3> Homo Bapieiis 

<220> 

<221> SITE 
^222> <9S) 

<dCit)> 

Met Lieu lieu JSer lie GJ.y Meb Leu Wet L^u Scir- Ala Thr (iln Val Tyr 
15 10 15 

Thr rift litsij Thr Vnl Oln Leu Phe Al.va Ph© Lieu Aan L^tij Ti©u rxo Val 
20 25 30 

Gill Ala A&p lie Leu Ala Tyj- An^n Phe Giu tu€^ Al^ Ser GLn Tluf Phe 
35 dO 45 

Aflp Aap Leu Prfi Alw Ars I'he Gly Tyr At^ Leu t^ro. Ala Glii <31y Leu 
50 S-S (to 

Lvs <3Ly Ph« Leu lie Aan Sesr Tjiys Pro Glu Asn Alii Oys Olu Pro He 
65 70 75 «0 

Val Pro P^> Pro Vol Lys Aep Aen Sftr S^jr Gly HiB Phe His Are Ve*i 
B5 SO S5 



<2rD> 65 
-c211> 54 
<23.2> PR'J^ 

<220> 

<?.21> SITE 
<222> (54> 

<J|^23> ^ixek equals el.^> translati^ 
<40C» 65 

Met Ala AAiA Leu Leu Jljeu Als Gly He DyB Jle Lfeu fi^v Asn Gly VdI 
J 5 in lb 

Pro GLn Aep Gin £f©r He Val Arg T?h-r- S©r Leu Ala Val Le.u Gly 
20 25 30 

lays Gly Cys Leu Ala Ala Set- Ph© Aen Cys lie Phr; Le«u Tyr a*hr Gly 
35 



Asn Cys lie Pr** <33.ti Xop 
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<210> 6£ 
<211> -64 
<il2> PRT 

<2!13> Bomo sapiens 
<220> 

<221> SITE 

<222> CM? 

<223:> Xaa equ^iiss stop trans La ti^jji 

MftK His Ttp ABn Leu PjfO ODr* Val Asn L^2U Php Ala i.eu Leu Lfifu T-,w 

ThT He LetJ "PhDC Leu Pro Hie Lau VrL Val Pro Tyz- His «is Arsj 

20 25 30 

His Tin: Gin *J.a Oln <51n Aan Ajjji Arc Glu Pro Tyx Leu <51n Asn Cyc 
35 40 ifi 

Gin Alia Kis Kia ieii His Gin Leu Pro t-he His Air^ AsiE> Gin Xaa 



<21t>> 67 
<211> 107 

<213> ]loiEK> saplnbtu; 
<220> 

<22L> SITE 
<227> no7> 

<222> Xaa e^oLs stop triau^^lation 
<400> 67 

M^t Phe Cya Phe Tyr Leu Asn Tyr ?h© Vhr Aern Lftu Ph^ Lou Phe Leu 

Thr Cys ;5er Ars^ Ser Glv S©r Leu sei:^ 5<ssr iTo Gly Prrt Tyr S©r 

^0 25 30 

Gly Phe Pro Ph© Leu Lya Ser pro Pro Val /krs Asxi £Ser Leu Aan Lya 
35 do 

C5iy Pro Leu Leu Val Qln TVr Tyr Bvt- Ph^t Ser S^r lll^ Tci« Ath Val 

5S 60 

Pro Artr LyB Lys Lys Gin Val Zi© Arg Val Pro v«l Ar^j vol Pro Pj^o 

7(> 75 
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I^ys Sex Pro Ala Met seDf P^i> Pro Sax Sex Pxo At cj Pbe Kis i^he Phe 
85 SO <*5 

Tl^r Phe Set- Giy Pt-o Ph^ Pro Asn Set^ Tyr- Xc>a 

100 105 



<210> 66 

<2:n.> 4 5 

^212^ PRT 

<220> 

<222> [dF^> 

<22^> Xaa equals atop t.r^^Ti%;>Atici3 
<400> 68 

Met Arg l>yf; TJjr A La Ttrp l^eu Cya Pli<; Phe Phe Gin Iji?s« CJys <31y Leu 
15 10 1.S 

Cly Gin Vsl Thr Ser L^ii Qln Tirr Arg Asn CyL> Asn Val Glu lie Uyr. 
70 25 30 

Pro Sei^ Lfcv V^^]. Arg <51v Thr Hie Ar.g Sex lie Ptc? X^a 
35 40 45 



<210> 69 

<211> 43 

<212> PRT 

<213-> Komo sapiens 

<220> 

<221> SITE 
<222> (43> 

<223> rftqueils Btop trans laLj.i>n 

<40O> 69 

Met Aeh Leu Deu Leu T.w Vnl Ser Thr Tjrp Met )flet JJeu lie Gin Glu 
15 10 15 

Giy aei; Cys Phe His Met rPh^ r.n^y. Heb Aan Glu Ltiu Lys Ara Oys 
20 25 30 

yyr Trp Ser Tyx- P)j^ Val Arg ser Hii= T l€i Jiaa 
35 ^0 



<21D> 70 

<212> PRT 
<2irj> Homo 



seipiene 
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<220> 

<221> SITS 
<222^ (56) 

<223> xea isi^ii/als stop tranwlttticn 
•c4(>0> 70 

W^t Pro CVb 'irhx c:yB rhr Trp Arg Aan Txi? Axg t;ln Trp ll^ Arx, Pro 
^5 10 15 

L&u val Ala Val lie Tyr 1*12 val Spl- He Vel V^ii Ain Val I'ro Leu 
20 as 30 

Cy« VftI, Ti-p Glu Leu Gin Lvl; l^ cilu Val <5.1.y He Hlb Tlir Ly^^ 
^5 10 4S 

Ttp Phe lie Ala Gly .Tl<3 i>he Leu l>su Xan 
50 



<210> 71 

<211> 4lJ> 

<212^ PKT? 

<213> Ul£:]inc:> 



<220.> 

c222> liS] 

^SSSs- Xaa equals rtt<?p trsnBlecitirt 



Met L.ys Set- Wir; Alia Thr Lftu Thr 

CJlv Leu Ser Phe L^^? Leu lieu Arg 
20 

Leu Tjfp Gin Phe Pxo Ly» jS^er Leu 

35 40 



Gly C5ly Eer Giv pbe Tyx Phe lie 
10 15 

S«ir V«il Leu Leu Val ittsv Vnl Leu 
25 30 

Thy «3ly Oln Glu Xaa 



<210> 7:^ 
<2L1> 71 
<212> PKT 

<^13> iJomo Bajjlcxis 



<220> 
<221> 

<2:?2> (43) 

<223> Xfy9 Bcjuals any of the naturAliy occurring L-6it»ino itcids 
<220> 

<221> SITE 
<222> (44 f 

<223> Xaa equA3s any of thi^ naturally occurring L-ainino Ai;?icr& 
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<220> 

<221> STT(3 
<2a2> (49 J 

<222> XAd i-iiup"L« any of the natuTalLy occuyr.iug L- amino acids 

<220> 

^?-21> SITE 
<222> (52 t 

<253r- XaiG egueils any of Tiobvzrally ot:<7iirxina L-amino ac^acJin 

021i^ SITE 

c223> xaa eqi^^l:^ {any of the nai:uT.7a:iIy occur rixig T/-ncnino acide 
<220> 

<223> SITE 
<222> (71) 

<22ji> X«9ci equals stop trauAJ-abion 
<400> 72 

Met Gly tie Phe Ser- Thr Leii Leu Lk^u Ala £fer Aesc^ i^iji- Leu Leu Asn 
15 10 1^ 

Leu I Li? I>cta Phe l?he Phe 1 1*5 Phe Ala Phe S«sr Val dya Leu 13ft j:^ fie-j- 
20 2Si 3n 

Sei Sex Phe Pro Sftr Oys Cya VaL Seji^ Vol Kaa Xaa Li^u S©r Vnl lie 
35 40 4S 

Xafi aiv Sex Koa iJer Ser H.i.a^ Xaa Ala Thr Cy& Ala His Thr Ser Le^u 
5D 55 60 

SSer Gly Thx Pro Val Met Xaa 
6b 70 



^210> 73 

<ill> 44 

•c212> PRT 

<213> Homo sapiens 

<1k2(» 

<221> SITE 
<222> |44J 

<r223> Xaa equal c:; stop tranBlal:i.i7ri 
<4D0> 73 

Mer. MiLit Sex Pro Ser C?ly Ile« lie val Tyr Val Ser Alt* Thr Pro His 
15 10 lb 

lie Leu Lev Qys lie Leu lie Thr ^P-he Net ht^v Ala Lie Pro ^<»r lie 
20 25 30 



His Aen Gly Arg V«l Cy^ V^il Leu phe lie Phe Xaa 
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^212> PRT 
<:221> SITE 

<223> xaa eiju/als. sbop tranBlatlo/i 

Mi7t. Hi .5 VaL fiifl Cya Phe Jil^? He His Val l^mj Pht^ His fe^he cy« s^i' 

1 5 no 15 

Thr lie Ser Ala A^t* Alo Leu Ser Phft Qys lie Ph^ Cys TVr oiy pro 
20 25 30 

Gin Thr Lftu He Asp >aet Ty^ Tti> ftsn £fer XaB 
35 4D 



<210> 7S 

<2il!> 17B 

<212> Pt5T 

•c2L3> Homo sapilftn!; 

<220> 

<222> (67) 

<22.^> e«3i^aiB any o£ thr^ naturally occiu-iri yig L- amino acide 

<221:^ SITE 
<222> (].7a) 

<223> ^e equalfj k^.t-.cp txanalation 
<400> 75 

Kl^C Phii Gin Val ArQ Pro Gly Trp GLn Leu Leu htjv V©1 Met Phe Sex 

Sex Cya Ala Val Soz Asn Gin Leu Lftu Val Trp Tyr Pro Ai^j Thx Alci 
20 25 30 

Leta Alfl Aj9p ABTi Lys P^t> v«il Alet Pro Aap A-rg Arc He Eer Giy Hd-. 
^5 AO d!> 

Val Gly lift TL<E? Phe i;er Met Ser Tyr Leu «Lu Ser Oly Leu Leu 

^0 55 GO 

Aid Thr Xaa Ser Glu A;5p Axa Ser Val Arg 'I'rp Lys VaJ. c^ly Abp 

70 75 ' fiO 
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Leii At^e V^n Pro C5ly Gly Ar^ VeL 

HIb Ser Ma Ajrg Val Trp Gin Val 
100 

Ser ALa fiiy GJ.ii Asp Cvb val CTyr^i 

11!5 L20 

Tl-c; L^u <31n ALo Phe G^y His 

130 135 

Gly Ala Glu He Hj 15 Alei GLu Leu 
1^^ ISO 

Thr Qly Thr Cl\i Ser Leu Alrt His 
Glu Xaa 



Gin Am J'Uz 01 y Rie Oye Phe Gly 
90 95 

Lyn Tjofu Lbu Glu Aen Tyr Lttu T3rjs 
105 110 

Leu Val Tjrp 5r^T Hi a Olu Olv <Slu 
125 

Gin Aep Vol Tyr Fro Val Val Val 

TVi- <31n <alu Leu AL« Tyr Leui Glu 
l«i5 160 

Leu Phe AXa Leu Val Fro A2^ Pro 
no 175 



<210> 7 6 
<211> d9 
<212> PRT 

<22a> 

<r221> SITE 
<22^> {49 J 

<223> Xas egualei; 4.=: top tranBlatlon 
<400> 76 

Ket VaL ThT Ph© Ala Sex S<?ir Thr Leu Trp Ho Ala Ala Tht iSer Tyr 
15 10 15^ 

Met »et val Trp Mp?t. Vol l^hx lie III;^ Gly 'lyr rrtii^ Leu OJy 11^: Pro 
20 2!> 30 

AEp Va.1 He Met Gly Aet> Hi a Leu Pro Gly C^ys '.Trp Asp Gin Ar^ Ala 
35 40 4.5 



<210> 77 
<213> 14 
<212> PRT 

<2l/S> Honno eapienc 



<iO0> 77 

Abu Tyif Phe Pr<> Vol his l*hr Gin l*ro Asn. Tri> Tyx Val 

IS 10 



m 
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<210> 7ft 
<211^ 31 
<212> PRr 

<2I3> Hcono Bapi£Kns 

l"ro Val Pfae Thr vai Ayn Phe l^eu 
1 5 

Ser lie Thr Val Asfp Leu iLe: Pjtd 
20 



Ala T-jfp Val HiB Ala Pj.-y pro Val 
10 IS 

Tbr Leu Ala <3Jti Ala Txp aer 
25 50 



<210> 79 

<212> PRT 

-<:^n3> KDiEK> sapiens 

<22C» 
<221> SIT£ 
<Z22> (13) 

<223> Xaa rtss'^^Ls any of the i^aLur^lly occuririAg L-nmano acids 
<4.D0> 79 

Trr^ He Oln Arg He Ancj TJir £fer Ala Asp Gin Leu Gly Pro Lyr^ Lyr=; 
1 5 10 15 

val val Xaa Plie- t^ly Leu Ala Cye Cy^j <31y val Sear flly b^u Phe iVr 
20 25 30 

Ala 



•c211> 3S1 
<212> PRr 
<213> KfidtKY :>»piena 

<220> 
<2iD> .S^lTte 
•q.222> <7ft) 

<^73n Xaa equals ar^y of the naturally occurring L-aiuinr^ ocidB 

<221> srTK 
<2a2> (326) 

-^2 23> JCfiia ^duals any of t)io naturally octtin-rin^r L-amino a<r:di3A; 
<400> ftO 

P:ro Pro C5ly Cyn* Ala A La He Pit> Lb« Gin iTbr Afg Sex Ala Gin 

Gly Pxo Trp G3y Gly Arg Gin Gl.y 5er Gly Trp Cy« Trp Gly TfjT- V«7 
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2D 



Vnl Gly Ser <31y Sei^ S<tr Cly Gly C5ly Aesii A) a Ph© "Jlir Gly Leu <3iy 

4(> 

l^ro Val Eer Thr- (n^u Pro Sex Leu Ha:* Oly J-ys Gin Gly V?iL Thr Ser 

!>0 55 

vai TliT- OyB His Gly Gly Tyr Vnl IVr "Thr Tlir OLy Ar^i Xa.a Gly Alss 
&5 7D 75 80 

^yr 'ivr Gin Lesoi Pliici Val Arg Aep Gly Oln Leu Gin Ptvi Va'l Leu Ars3 
SB 9Q gs 

Glii ljyr> ^^t: Cys Arjs Gly Mwt. Asn Tr^^ Leu Aln Cly Leu Arg lie Val 
IDQ 105 110 

Pro Asp Gly sej: M&l- Val He Leu Gly Phe Bis Ala Aku C^lti Phe V£»l 
115 320 .125 

Val Trp A2fftTi Pi?o Ar^ Ser nit (?3u Lys Leu Hit JJ.e Val Asn Cya Gly 
130 135 140 

Gly Gly Kifl Arg Sei^ Trp Ala Phe Ser A&p Thr Glu Ala JVln^ «et AIq 
Ids 150 1S5 160 

Phe Ala Tyir Lol> l^ys Asp Gly A^p Val Met Leu Tyr- Ang Aln Leu Gly 
165 L70 175 

Oly CSys Thr Arg Pro K-is VaL lie Leu Aig Clu Gly Leu Hie Gly Axxf 
3B0 165 190 

Glu lie TJli- CyB Val Lyes Arg Val Gly Tlrr lift Tbr Leu Gly Pro Glu 
195 200 205 

Tyx <31y Val Pr^ B^sr Phe Met Gin Pi-o Asi> Aap Leu Glu Pro <31y Ser 
21P 215 220 

Glu Gly Pr^ Asp Leu Thr A&p He Val He Thr Cys iSer Glu Atsg* Thir 
2Z5 230 225 240 

Thr Val cyB V&L I.gu Ala Leu P^i> Thr 'llir OThr Gly iSc-r A.l« His Ala 
2415 ^>^0 255 

Leu Thr Al?% Val Cys Aen Ri& He sex Ser Vwl Arg Ala Val Ala Vai 
260 265 270 

Trp Gly lie Gly Thr Pro Gly Gly Prc> Gin Asp Prn Gin Pro Gly Leu 

260 2S5 

Thr Aid His val Val ^yr Ala Gly Gly Arg Ala Glu Met Kj a Cys Phe 
290 295 300 



ser He Met V«l Thr Pro Aep fro Ser Thr Pro S<?r Are Leu Ala CJyi- 
3t?5 310 315 320 
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Hl& Veil iSet His Leu 3Caa S«fr Hl& Atr Leu Oly Tyr Txp Aep Arg 

323 3:3<? 

Gin Arg Abi^ Arg Hi a Arg Met Val Dye Val Asp liTo Glu rrhr Ai>g 
340 34& 360 



<210> ai 

<ii;Ll> S& 

<212> PRT 

<213^ Hcmna sspienB 

Pro Pro Gly Leu eyes An» Ale lie Pro L«sij GLn llir Ar§ Ser A.l» Gin 
1 & 10 15 

Gly Pico Trp <l]y Gly Arc Gli^ Gly Sor Gly Trp Cyr; Trp Gly I'hr Val 
20 2S 30 

V*l Gly Sar <51y Ser Ser 
SB 



<21Q> 32 

<211> dO 

<212> PRT 

<213> l^ocnD s«Lpl«ine 

<220> 

<221=' SITJS 
<222> [dO> 

<22'^> XttQ equals any oL th-^ nek tur ally ocourrinc atnino acide 
<400> 62 

Gly Gly Gly Asn Ala I'he Thr Gly buia CiLy Fro Val Sr^^r Thr Leu E^ro 
IS ill 15 

Ssr Leu Rlb Gly Lye Gin Gly Val Thr Sei^ V«l l^r cye Hie Gly Gly 
20 25 30 

Tyr Val Tyi Thr Thr C5ly Arg Xa^^ 
3& 40 



<210> ftS 
<211> 40 
<212> PRT 

<400> 82 

Gly A?.ci Tyr lyr Gin Leu Phis Val Axg Aap Gly Gin Leu Gin Prv^ vaJl 
15 10 15 

Leu Arg Gin Lye Siar Cys Ajt^t Gly M<?t Aen 'Jrp Leu Al& <^ly Leu Arg 
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20 



2b 



30 



lie val Pro A&p GJy Ser Mat Val 
3S 40 



-c210> 84 
<iin> 41 

<il"^> HuoTO BGpianB 

lie Leu Gly Phe E3ie aIh Asn Glu The Val Vpi Trp Asn Prt> Arg ser 
15 10 15 

His Glu Ly« LAij Hi.s 11© Val ABn Cy^ Qly Gly Gly Hi£i Ar^ gar Trn> 
20 25 30 

Aitf^ Pfcac S^r Asp U'hr Glu Ala A.1pl Mat. 
35 40 



<210> «5 

<211> 42 

<212> PRT 

-^213> Homo &apaOT& 

<dOO> a5 

Aid Pha kla lyr Leu Lys; Aap Gly Aap Val MeK l.oii Tyar Arsr Ala Leu 
1 & 10 15 

Gly GLy Cyfs Thr Ax^ i*ro His val I.l.c- Leu Ara Glu Gly Leu Hi« Gly 
20 25 30 

Airg G3.V I La Thr Cye Val Lyrfi Arg VrI CSly 
^5 AO 



-cail> 43 

<2ty.> PRT 

<dOO> 6£ 

Thr lie Thr Leu Gly Pro Glu 'Pyr Gly Val Pro Sex Phe Met Gin Pri> 
15 ID 15 

ABp Aep Lfju Glu Pro Gly Svr <3Lu Gly Pro Asp L^^u Thr Asp lie Val 

20 

lie Tlir eye S^t hsp *lhr Thr Vial CVfi* Val 

40 



<2L0> 67 
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<4Q0> flv 

Leu ALa Pro fThjfc- Thr Thr uly Ser Ala Bis Ala Leu Thr AIg Val 

^5 ID IS 

cya Asn 1]<- Ser Ser Val Ale Val Ala Ve^^ Tn> Glv^ He Gly 

20 25 30 

Thr Pmci Oly Gly Pro Glin Ji^sp pjrc* Gin 

40 



<211> dO 
<212> PRT 



<4oo> as 

Pi^D Qly l^m: Thr Ala Hie V«il Viil 
1 5 

Eie OyB Phe Sejr llo M©t Val rhr 
20 

Leu Ala Cys Kie Val MeL Hl:> 

35 dO 



Ser Ala <Jly aiy Arg Ala Gly >5Mt 
ID 15 

Pi'ij* A9c> Pro Ser T-hx- Pxo Ser Arc 
25 30 



<21D> 149 
<211> 26 
<212> PRT 

<21^> Homo sapieTi:^ 



<220> 

<221> STVB 
<222> (1) 

<223> x«o ©ijuals any of bb© naturally oi^curring L- amino acl^Srj 
<4D0> 99 

Xziei Ser Ar»g Leu Aap Glu a)yr Trp Asp Ay^ <3i.n Arej Am Arg Hi a 

^ S 10 :i5 

Arg MQb Vzil LyB Val A«p I>xo Glu fChir Arg 
20 25 



<2no> so 

<21i> 68 
<212> PRT 
<213> Homo 



^400> SO 



BNSDOCID: <WO 9946289A1TI_^ 



• 



wo 99f463S9 PCT/US99yD5721 

S5 



I>eu tlet Sex l^eu Leu Thr- Per Tid 
1 5 

Ala Sejf Ala S^t; Pro Ser Ala Vsl 
20 

Gl.Ti Oln Lye ^51u Pro Leu Si^t- Kie 
25 40 

Lys iTo Lys lie Il±t: Cys Dhe Arg 

Ala PiMii f3.U3 Sf^t- CVb lieu Ser C;iy 

Cys lie T!hr Abw Pbe Vwl Cys Glu 
35 



Hie gIm Pro Pro Pro Pro Pji> Pit> 
10 15 

PiD Abu Gly Prfj C31ti Eer Pro lvb 
25 

Ara Phfe Ann Glu s?he Met Thi^ Ser 

Sej:^ T^i Lys Arg Gly Vol ger Ser 
60 

Val Leu Trp Unt His Val Trp Pbe 
75 BO 



^210> 91 
<2il> 53 
<212> PRT 

<213> Hoano sc^piens 
<40O> 91 

P*te Gin Asn Ala Lye Olvi Olu Ala Ser Va3 Leu Pro Tyi^ Val m.y Thr 
i & 10 15 

Val Phe Leu Pho Gly Cfly Gly He Ph^ Ala Jtet Ala Lw Cfys Leu 11© 
20 25 30 

Scr Asp Ala Leu Ser Scjsr Tyr Ars^ Asp Cej- Bia Thr Aan Arg Val Leii 
35 45 

5*hr S^t: Pro Pro l>he 
50 



<210> $2 
<211> 45 
<21i> !?RT 

<213> Homo eapieii:> 
<400> 92 

Arg Le\j l&v.t Pro Phe Pro Vr<y Sor 
1 5 

Ala Oly Arci Glu A^sp Veil Val Gly 
20 

Hie LOU Leu Olu He Val Tbr Ket 

3S do 



fJer PDfo -ftig Leu Leu Val O^hr Leu 
10 13 

T5is Ser Cys Af;rt Thr Leu Ser Ala 
25 30 



Leu ll<^ Thr Trp Phe 
45 



<210> 9'S 
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<211> 51 

<.212> PKT 

<213> Hcmi r^ii£73i7nu5 

<220> 

<2^^> J?ITE 

•c222> (3) 

<1t^'S> Kan, e^alB any of tlift xiflt-urally occurring I.-cscnino aci<3s 

Gly Gly Xaa Aep Aui:> A.?p Glu Gly Pro l^r- -Thr Pro Phef Acp a>»n- 
15 10 irj 

eer GSly Lys Lew 01 u Thar VisL T.ys Trp Ala Phe Th-r T:rp Pxo Ser 
20 25 SO 

PliB Val Deu lyr Phe Ttji- v«l Fro Aan Cyti A&n ijys Pro JLr^ T*p Ql.ix 

40 45 

LyB rap Ph^s 
50 



<210> 9i 

<2L1> 115 

<212> PET 

<3L3^ Hcxmo Bap lens 
<?20;* 

•^22 1> SITE 

<222> (99) 

<225> X£i49 oh^iurIb any of tlifr uatvTrally occusrri)!^ Li-nminD a^idB 

Gly Gly Pro Arg Neh Lys Ar^j Sex Gly Acw Pro Gly Ala Giu Val Tlir 
15 10 15 

ABit set a€tr Val Al3 Gly Pro Asp CVB CyB Gly Oly Leu Gly Aan lie 
20 25 3D 

ft»p Phe Arg Gin Aln Asp Phe Cys Val Mefc Thr Aro JJeu Leu Gly Tya- 
35 ^0 ^5 

val Aap Pro Lot Asp Pro Sen- Ph53 Val Ala Ala Vis.l He Thr He Thr 
50 55 60 

Phg Aen Pro Leu Tyir Trv Abu Val Val Al<? Axa l^p Glu Mi*^ ly:? Tbr 

" 70 7s ao 

Arg liys; Le« Ser Arg Al« Fhc Qly Ser Pro Tyx- T.eii Ala Cye Tyr Ser 
86 5D 95 

l^E^ ser Xa£E Thir He Leu Leu Leu Asn L*e Leu Arg S^j- His Cye Phe 
100 105 110 
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Thr <31n Ala 
115 



-=211> 51 

<3I3> f^oniD jBdpleir^f: 

C^ly G3y Pro Arc >let LyB Ar^ St\r GLy Am Pro (31 y Ala Glu Val Thr 

1 i> :io 15 

Aan S&r Ser val Al» <31y Pro Asp Cyf: Cy& Gly Gly Leu Gly A^n 11^ 
20 25 30 

Asp Plift Ajg Oln Ala Asp Phe cys V«l Met T*hr Artj T.t^u Leu Gly lyr 
35 40 05 



V*l JVjsp Pxd 
SO 



<210v 96 
^211^ 6d 
<212> PitT 

<;213> Hoonci s.sipiens 

<220> 

<222r^ 

<:223> Kaa equale $i^y of th« naturally occurring Ir-amino ftciiSs: 
<400> SG 

Leu Asp Px'rt Si^r Phe Val Ala Al« Vol lie T*hr 11* Thr Ptwss Asn Pro 
IS 10 15 

I>»u TVr Trp ABti Val VaL KL^ Arg Trp Txli* His Lys 'Ihr Arg Lya Leu 
20 25 30 

Ser ADf>g Alp Phe Gly Ber Pryu Tyr Leu Ala eye Tyr Setr Leu Ser xaa 
35 40 <J5 

Thr II© Leu Leu Lt?u Asn l>he Leu Atg Ser Hie eye Ptie Thr G'ln Al« 
SQ 5.^ 60 



<21D> 3J7 

<211> 253 

<212> PJ*T 

<213> Homo sapler?:? 




# 
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<40O> 97 



Prc> <31.n aang 8ot Giu Leu Ala Ale AJ.*? S^r Asn Args Pri* Cyw At^ Vol 
is 10 15 

Cys 11© Sex Leu Leu Leu cyi* Jj^vt Glu Asp ArQ rt^r M©t Pro Lye Lye 
2n 2li 30 

Ala Lye Pro Thr <3ly Sex Giy Lvs Glu Glu Gly Pxo Ala Pro cys Lys; 
35 dO 

nin Mifft iv* Lou Glw Alia Ala Gly Rly Pro Ser Ala L^i:^ A^xj phe As© 

55 60 

^er Pro Ber Ser Leu Phe qlu Sex Leu lie fitu- Piro lie Lye Tlxr Glu 
^5 70 75 80 

Thr Phft PtiK Tjys t^ivi Phe Trp Glu Gin Lys Pro Leu Lftii Tl^ Gin Jtrir 
as 90 95 

Abp Abop l>ro Ala Leu Ala rPhr Tyx 'lyr Gly s©r IjQU Phe Lye Leu rhj- 
100 105 ILO 

Asp l^u TJyt-i 5cr L©u Cys Ser Arg Gly M^t Tyr- Gly Airg Asp Val 

3-15 120 125 

Asn Val Cvb Axg eye Val Ar^n Gly Lys Lye Ly& Val Leu Asn Lys Aep 
, 135 lio 

Gly Lyes Al*i Hi& Phe Leu Gin Leu Arv Lys ABp Phe Aep Glxi Lys* Arg 
145 15D 155 160 

Alo Thx lie <Sln Phe Hie fjln Pro Gin Arg Phv I.y& Asp Glu Leu Trp 
165 170 17.^ 

Arg He Gin Glu Lye L&u Glu Cye lyr Phe <SLy Se^ Leu Viil Gly jg^r 
180 1S5 190 

Asm Val Tyr He Thr Pro Ala Aep Leu Arg A].a Cye Arg Pro lie Met 
1*5 200 205 

Met Met. s?i^ir Arg I^he Ser Ser Cy:; s^r CTrp Arg Glu Arg Asn Thr Gly 
210 215 320 

Ala Ser "Jhr OOur Prrj* Leu Cye Pro Trp hHe^ Glu Ser 'fhr Ala Ti^i^ Ai^g 

230 225 240 

Pro Axg Lys Gly 5er Ala Gly Arg Cye Met S&r Cys 



<21&> 9S 
<212> PRT 



<?I3> Hcsono sapie!n£f 
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<400> 9^ 

Pxo Gin Ara i3er Glu Ij^i? Ala Al€i Ala Se^ Ar.n Arg Pro Oys Axa Val 
15 10 1^ 

Cys Ilfi l^jii Leu Cys Leu Gl\i Ji/^p Axa l^r Paet Pro Ly« Lyf; 

20 25 30 

AlA Lya Pto Thr Gly Ser Gly Tiys; 01 u Glu Gly Pm 
35 <I0 



■^210> 99 
<211> 45 
<212> 

<213> Homo Bapien£s 
-:400> 99 

Als Pro Cyii J^yjs Gin Met Lya Leu fiJx? Ala Ala Gly Gly Pxr;^ S^r Ala 
IS 10 15 

Leu Asn Asp Ser Pro Sox Soar Leu Phfe 03 \3 Bbt Leu lie Ser Pro 
20 25 30 

lie Lya Thr QHu Thr Pbe Phe Lye filu Phe 'I'rp Glu Gla 
35 40 4& 



<21{)> 100 

<211> 44 

<212^ PRT 

<213> fiocnD sapiens 

<4tJQ> 100 

L-yB pro Lfeii Tou 11 B C5ln Arg Aes£i Aa;p Pro Ala Leu Ala TliiT* Tyj- Tyx 
15 10 15 

Oly Ser Leu Phe Ly^r T,^ Thr Asp Leu Lyf: Sex Leu Cys aer Arg Gly 
20 25 30 

Met Tyx Tyr Gly Arg Asp Val A&n Val Cys Arg Cy.y 
35 



•q.210> 101 

<2115- 45 

<212> PR-T 

-^2135- Homo Bapiej^Ri 

<4O0> 101 

Val Acn Gly Lys Lya Lys VaD T.-©u Aan Lye Acp GHy Lys Ala Hie Fhe 
15 15 



Lwi Gin Leu Arg Lys Asp Phe Asp ^l.T^ Tyf> Ars Ala OThr ll<j <31n Ph^t 
20 25 
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60 

H-L.s G3o Pro Gin Arcr Phe Lye A.sp Gl« Leu Ti^p At^ He 
3S 40 d5 

<c210> 
<211> 44 
<212> PRT 

<2I3j^ Uocnck sapiens 
<400> 102 

Gin filu LyK I^ij Qlu C!ye Tyx Phe Gly Ser Leu Val G.ly Sex Asn Val 
IB 10 15 

Tyr He Tlir Pxo Ala Aep Leu Aro Ala Cye Ax-g Pro lie Met Met Met 

2S 30 

Sear Arg Plte Ser Sftj; C?y3 Ser Trp Arg <:ilu Arg Aen 
35 40 

<21D> 103 
<211> 31 
<212> PRT 

«S00> 1^3 

Tlir Gly Ala ser Tlisr T?hT- Pro Leu Cye Pici Trp His Glu Ser Thj- Al£i 
^5 10 15 

Trp Aj^ Pttji Arc I^ys Gly Se^ A) a Gly Axg Cye Mot Ser Leu CVa 
20 2b 2U 

<2L0> 104 
f211> 53 
<212t' PRT 

<213> Hojoo eepionj; 

•c220> 

<22l> 

<222> (53) 

<223> Xa<i <x3ucil& any of naturally ocfturrina L- amino acride 

<dOO> IDd 

Gly Gly aiy lie UiA Arcj Leu Hie At^ aiy Ale Leu Gli. Apsj Val 

^5 10 15 

I^u Gill Ars Val Glu Hifj t>e?u Hie Leu Leu His His Ala Val Lyft Hi..; 
2^ 25 30 

lie cys Thr Al^ Ser Leu Pic^ Val Leu His Gl> Phe He Ala Ala Gin 
35 40 

Cys Arg Pro Gly Xaa 
SD 
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<210> 105 
<211> 151 

-c213> Hovna esplftiis 
<A0O> 105 

Tn> Asp Ar^r *i^'P Ser Acp Sftr AIr Len Arg Aicg Arg 01 v Ser Gly 
15 10 

Asc> Leu Ala Gly Glv t.to Glu Glu Leu GJ.v Clu Gtlu Arg Ala A.lf> CVn^ 
20 25 30 

din <31v Cys Ara Ala Arg Ar^ Ptto Tjtp Glu I.eu Phr^ Cln his Krg Ala 
35 40 45 

Lew Arc Arg Gin Val Tlu- Scr Lea Val val Lcjv Oly iJer Ala Hi±t. QIm 

50 55 eo 

Leoi eye Gly Ap^n Asp Ser Val Ty^ Ala TVr Ala ser Sea- VaI Phe Am 
65 RO 

Ly& Ala Gly Val Pro Gin Aln Lafs lie Gin Tyi- Al« He lie Gly Thx- 
55 90 96 

Gly ser Cys Glut Leu Leu O^ir Ala Vl^\ Val iter Val Sftx i.^iu Olu Gly 
100 lOS 110 

Ala Lau Pro Paro Pro Al^ Leu ^'rp Gly Gly Tlir Pro Aro Ser Sev Ale 
11.5 120 12h 

Leu Afiifl 01 n Pli"© Thr Leu Glxi l^u Tys Lyfl Lye Lyc Lyis T..ya Lye Lys 
130 135 140 

Lya Lys Lys Lye Lyc Lyj; Lys. 
145 150 



<210> 106 
<211> 37 
<212> PRT 

<213.> ^omv sapiens 
<100> 106 

Ar^ Ar^T Leu Arg Gly Sts*v Gly Asp Leu ad?j Gly Gly Leu Glu QIvl Loy 
15 10 15 

Glu Glv Clu Ar55 Ala Ala C^ks Gin Gly eye Axy Ale Arg Arg Pro <Prp 
20 S>!i 30 



Glu Leu Phe G1j> iiis 
35 



W0 9*/462*9 




<210> LOT 
<21lN 29 
<212> PRT 

< 2 1 J > Homo B aplene 




Gly J^'sn £?©r val Tyr Al« Tyr Ala Ser Ser Val Phe 
20 ^5 



<210> lOfi 
<211> 3A 
<212> PRT 

Thr Gly Sei^ Cy:; Glu Leu Leu Thr 
1 5 

<31y Ala Leu Pro Pro Pr» Ala Leu 
Ala Leu 



Ala Val Via I 8ex Val Ser Leu Glu 
10 L5 

T-Jf^P aiy Gly iThr Pro Arg S^r Sex 
25 3D 



<210> 1C)9 
•e212> PRT 

<2i3> Kocro sapiens- 
<40O> XQB 

His Glu lAu Aang Leu Arg Lye A^ Thr Val Ly^: Pht^ s«x Leu ryr Arg 

Mi» Ph<5 Lys Asn Thr L^^u lie Phe Al^ Val Leu Ala S3er xi<? Val Phe 
20 25 3D 

KBt Gly Tifi> Thr Thr Lya Thr Phe Arp lie Ai« Lys Cya Gin Se:r A-sp 

40 45 



<:^ao> 110 

<211> 2c 

<213> HL>ikv> sApiene 
<dOO> 110 

Trp He Pro Aang Ala Ain Gly zlv Ara Hie Glu Glu Sor He Ala Glxi 
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10 IS 



Arc Ser Tyr Dhe Arg Thr Leu Tiufu 
20 



<211:> 104 
<212> PRT 

<4D0> 111 

Ala KB-p Thjf Aan Phrjs Thr Oln Glu Phr Aln Met. Tl^r MifsL lie? Thr I'ro 
15 10 IS 

Ser Sei- Lya Leu Thr Leu oOiiir Jqts Olv ab]^ Lysi fier Txp Ser Eisr- Thr 
20 2^ 30 

Ala val Ala Ala A 1a Ijcku Glu Leu VaL Asp I'ro Pro Giy Cy© Arra Abd 
35 50 45 

5ftir Alvi Arg Gly lie Aan Cyr. S^t Ala Phe Leu Leu Pro Tyr Ser S^r* 
SO 5!> 60 

his Val Trp Val Pry Lou Sor Gly Val VatL Pro Leu cye Gin ftrvj Asn 

65 70 7B eo 

Girt Olv His Thr Val Trp Val CH.ii lie lie Tyr Ser Arg Ser sex Phe 
R5 90 95 

'1^ ABp val Phe lie Ser Thr Arg 
100 



<.2La> 112 

<211> :i« 

■cr212> FRT 

<213> Hcunjc;! sapiena 

<dO0> 112 

Met Otor Met Il<t Thr Pro Ser Ser Lya Leu Thr Leu Thr Lys G'lv Abu 
15 10 15 

Lys /?^3r Trp Ser Ser Thi- Aln Val Als Ala 
20 '^5 



<210> 113 

<:211> 20 

<212> PRT 

c2I3> KoiiEK> &apiic7n& 



c4D0> 113 

Aif9 Gly lie Aan Cyti S!t^-r Al« Phe Leu L&u Pro Tyr Ser Rr^x His VaL 
15 10 IS 



64 



'rjTP Val Fro L^ffu 
20 



<210> 114 
<211> 2d 
<:517> FRT 

^213> Homo eap^i^nK 



•c400> IIJ 

VAl V«3 Pto Leu CyB Gin Arg A;*rj Gin Gly Hie Thr VaJ Ti-p Val Gin 
is 10 15 



ll& lie fPyi ser Jij^^9 s^t 5er Phe 
20 



<210> 115 

<2l].> 26 

•:212> PRT 

<21^> H<:MtQ sapiens 

<iOD> lib 

A&n E^h© Asp lie tjya Viai Asn Ala Gin Arg AIa Oly Tyr Lys Ala 

is 10 

Ala lie Val hIr^ Asn VaL Asp Sej^ Aisp Asp 
20 25 



<210> 116 
<211> 28 
<212> PRT^ 

<213> HC3I!IM> eBple»!; 

<40o> lie 

val L^u Lys Lys lie Asp lie Pro Ser Val Phe lla Gl>/ 0\\.v^ Sex Ser 
3-5 10 15 

Alo Aan Ser Leu Lyt^ A&p Glu Phe OThir Tyi^ <3Iu Lys 
20 2& 



^210^ in 

<211> 30 

<"212> PUT 

<21^> HooHQ sapiens 



<dOD> 1L7 

Pro Glu Phe Ser Leu l^ro Ijeu Glu 
1 5 

rife val Oly He cya Leu IL^ Leu 
20 



Tyr Tyr Leu lie Pro Phft l.^;^ lie 
10 15 

lie Val lie Phe Met lie 
25 3Q 
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^.21 0> lis 
<211> 34 

<40o> lie 

'irhr Lys PJi« Val GLi^k A!5p Arg Kie Arg Ala Ai-g Ars Abu Arsy Leu Arg 
1 10 15 

LVB Aap <53ti Leu Lyt l.ys Leu Pro Val llisi Phe Lvs Lye Gly A&p 



Glu TVr 



<^:\1> 27 
<212> PRr 

<213> Hcono &apL«nr^ 
^400:- Li3 

Glu Ar^p Gly Afeji Lya Leu Arc? 11 o Leai Pro Cys Ser Hi.ij Alo OVr Hie 
IS 10 1& 

cyft T^s CVS v±*:i. Aap Pjto Tip T-^\ Thx c^ys Thr 



<210> 120 
':211> 24 
<:il2> PRT 

<4P0^ 120 

Vol Val Pro £?er Gin Gly Asp S€ix Asap e^jjr Asp Thx Ase> Ser S«r Gin 
1 5 10 15 

<31u Glu Ajsn <Slu V»l Thx Glu Kia 
20 



<210> 121 
<211> 29 
<212> PRT 

<21'3> homo :&Qpien£ 
cJO0> 121 

Glci 5er Plio Cly Ai« 3.©xi Sex Cilu Ser Ary Sftr- His Gin Aeti West "Hix 
15 10 15 



20 



20 



25 



Glu Sex St:r Asp 01 u Olu Asp Acp Asn Glu Asp 'Jlir 

20 25 
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<210^ 122 
<211>^ 259 
<212> PRT 

lie Arg Arg Lcni Asp cys Aan Fhe ABp lit Lye V*il Leu Asn Ala C51n 
^ 5 III 

Arsj Ala Gly Tyi- bys Ala AL« IL^ vnl His Asa V/sl Asp Ser Aep A5;p 

25 30 

Leoi lie Ser K^t. Gly Sejf Asm Asp lie Glu Val Lnu Lys Lys iLe Asp 
35 40 45 

III!? Pro aer V£il Phe He Gly Glv fi©r Ser Ala Atxi S^r Leu Lya A>p 

5<> 5s eo 

Giia Phe rhn Tyr Glu Lye Gly ^3ly hiLa Leu iLe Lcju Val Pro Gin F^f3 

70 75 eo 

Ser Leu Pro u^u Glu Tyjr Tyz: Leu He Pro Phe Leu He lie Val Oly 
fiJi 90 55 

Ele eye Leru He Leu lift Val tie Phe Wet lies Thr Lys Phe Viil Oln 
100 1D5 lip 

Abp Arg Hi-i Ara Ala Krg Ajrg Asn Arg L«u Atxj Lys Asp Gin Lki? Lys 
115 120 12!i 

h-YB Leu Prvii Val Hi a Ly:; Phe Lys Lys Gly Artp Clu 'Vyr Aap Val Cy© 
130 13& IJO 

Ala He Cyw Leu Asp Glu Tyr C51u Aep Gly Aj^^ti Lys Leu itrg Leu 

150 155 160 

PxD Cys 5<-r HiB Ai» iVr UiB Cye Lys: Val Aep Pro Trp Leu Thr 

1^5 170 175 

Lya Thr Lys Lys Tlir Cya Pro Val Cy?f Lys Gin Lye Vei Vwl I^ro Ser 
180 185 150 

61n Gly Asp ser Asp Ser Aep Th^ Asp Ser Ser Gin 01 u Clu Asn Glu 

200 205 

Val -rtir Glu Hie Thr ^jto Leu Leu Arg Pro Leu Ala B^r Val Sex AIa 

215 220 

Gin SejT Phe Giy Ala Leu Ser Oly Ser Arg Ser Hiw GLn Asn Met Thr 

230 235 240 

Glu s«r Ser Asip ivr Glu Glu Asp Aep Asn Glu AAp Thr Asp Ser Ser 
2^1 5S0 
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A.-9P Ala Glu 



ic210> 123 

<211> $& 

<2\2> PR'X 

<213> Homo SApi^ng 

<*0<?> 123 

Ala Gin Cy& S«t il^ Tyx Leu He Girj val 13 12 "Ph^ Oly Ala val A^p 
15 10 13. 

PXD Alii Lys; Lew Val Gly Phft lusu Vrtl 1 3 15 Ajsn Ser Leu Gly Arg 
20 25 30 

Ar^j Pro Al£i Oln 
35 



^2ia> 124 
<211> IBS 
-:212> PRT 

<2l:^> Homo aaplenB 
<-10D> 1^4 

Gly Thr Val Gin His pro Asn Pro Gly Aep Leu Cys Cys <51y 

15 10 IS 

Pdtd Ala Cys Gin Ala C-Vfii Gly Leu Pro Cys Hie Gin Leu Pin Oly Ser 
20 2^ 30 

Pro Ala Cye Pro Anp Gly Cys l^i" Ala Gly Arg His: liw HiB Dro 

40 4S 

Ala 01 n 'rrp Gly A?;p Thr Pro Gly Pro Val Hie Cy$ Pro Asn Leu Ser 
50 5b 60 

Qys Cys Ala Gly Glu ialy Leu Ser C5]y Clys! fjcu Leu tiln Leu Hie Leu 
65 70 75 60 

Pro Val ryr Tscp <31u Leu Tyir Prtj Thr »et lie Arsj Gin Thr Gly Mc,-t 
&S 90 95 

Gly Met Gly Scr Thr Met Ala Ais Vttl Gly Ser lie Val Ser Prti htdyx 
100 L0!> IID 

Vzsl Sea: MeL Ttar Ala Giu Leu Tyx- Pro R^r Met Pro Leu Pbe lie Ty?- 
115 120 12b 

Gly Ala Val Pro Val Ala Ala ser Alck Vol Thr Val Leu Leu Pxo 01 u 
130 135 140 



Thr Leu Gly Qlrt Pro Leu Pro Aap Thr Vul Oln Asp Leu Glu fior Arc 
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150 155 160 

Ly!^ Gly Lys Gl/i rhr Arg G1j«i G1j» nin oiv Hijs Oln Uys Tyt W^t- Val 
1G5 170 17D 

Pj^ Lo^ Oln Ales ficr Ala Gin Glu IryA^ Ar^n Oly l^e^ 

--ZLOtN 125 
<2\:?.> PRT 

«»00> 125 

Leu Pro A£ij> Pro Uly Afip L©u Tirp Cys Cys Gly Pxu Cye Gin A2*i 

Cys Glv L^iJ Pro Cys Jiiu <3in 
20 

<3ll> 26 
<2L2> PRT 

<213> Homo s€Apienii 
<4{?D^ 126 

Gly cy* Thr Ala AJn Gly Arg Hia Leu Hi* Ptt^ Ala Gin Trp GLy Abj? 
^ S 10 16 

T*hr Pj^ Gly Pro v?il Kis Cya Pro Ak^n Lcoi 
20 2!> 



<210> 127 
<211> 22 
<212> PR-r 

<213> Homo sapims 
<dO0> 127 

ijcii His L<fu Pro val TyT- Trp Glu 

Thr Gly Met Gly CleL fl3y 
20 



lieu Tyr Pjo Tlir Het lie Arg Gin 
10 15 



<2tO> 12B 
<2ll> 23 
<212> PRT 

<213> Homo B.apim& 
<flOO> 128 

Leu V^il Ser Meli Thr Ala Glu Leu Tyr Pro iJer aet Pri> Lcfu Phe lie 
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Tyx G.l.y p.^<s^ Val Pro Val Ma 
2D 



in 1 ?k 



^210> 129 

<2i;^> PRT 

<21"i> Homo £fap.i<jn& 

<dQO> 129 

Fro A.Bp Thr VmH Gin Aap Leu GJu 
1 5 

Uln Gin Gin GLv HIb <Sln Lye: Tyir 
20 



£>T7V Ara JjVB Gly liy!i GJn Thr Arq 
10 15 

M^t Vol Pro 
25 



<210> 130 
<2ll> 720 
<312> PRT 

<400> 120 

eye Ijeti Ala fl.la M«h Tie Olu Gly Glu Xle t33ui Ser leu His Sftr- 

5 in 15 

Glu A!;tj S«- GLy Lv« Scr Glv Gin Glu nl.y Trj> Phe "l^r Glu Leu Pru 

25 3p 

Pj!M:> Val Leu Thjf^ Phe tilu L«U S^T Jlx-g Pb^ Clu Fhe Asn GIa AIm Lbu 
2b 40 45 

Cily Ara Pro Gly Lys Lie hIb Aen r-yrri IFC!^i Olu Phe Pro Gin V«l Leu 
BO 55 <vO 

Tyr Leu ABp Tyx Met f3ls A^y Akt^ Aarxj Glu He Tbr Aig Tl^ LryB 

&5 70 75 ecj 

Ar& Cflu Glu II^ Lye Arg Leu LyA At;p Tyx Leu Thr Val Ij^u CLn Gin 
SS 1^0 9S 

Ar«g Lot. Glu Arg Tyr IjBU Ser Ty^ <3)y Sbl- Gly Pro Lyfc Arg phe Pro 
100 lOS 110 

L<i!u Vol ABp Veil Leu GLn Tyi- Al« Leu GJlu Phe Al^ Ser Ser Liys PJfft 
lis 12D 125 

V«L eye T>ir Ser Pro VisL A&p A&p lie Asp Ala Svr Ser Pro Pro 
1.^0 135 140 

Gly Ser I 1b Pro Ser Oln Ihr Leu Pro Sfti^ Thr Thr Glu Gin GJ.ji Cly 
Ids 150 15& 160 
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70 



Ala Leu Str Aiyr aiu Leu fi^r ^hr l»ro Ser ZSer v^sl A.lft Ala 

no 175 

lie Sejc S<!ir Arg Sex Vesl IJili His Lys Pro Phe "JOir Gin firtr- Arg He 
ISO 185 ISO 

Pro PtM> A^p Leu Pro Iflisl. Hi& Pro Ala Fro Ars His ll^.- Thi- Clu Glu 
155 200 2i>b 

GlQ Lc2M Ser Val Leu GJu 5ot Cy& Leu His Arg Ti^ A/g Thr Glu He 
210 215 220 

Glu Asm A&p 'Jlir Arcf Asp T^M Gin <51u Ser lie Ser Aig Tl*? Rie Arg 
2:25 ;?30 235 IJdD 

T-hjr lie: Qlu Leii Met Tyr Scir Asp Lvs Ser Met He nin V«L Pro Tyr 
^J^^ 250 2&i 

Arg Leu Hi-j Al^t Val Lftu Vul His Glui aiy tsli^ Alb Aj-.ti A.ln Gly Hie 
260 270 

•Tyr Trp A La Tyx He Pho A&p His Arg Glu Ser Ajxj Trp >SBt Lys Tyr 
275 280 285 

ABO AAp He Ala Val Thr Lysi Ser Ser Trp GLu Glu Val ArQ Asp 

290 295 300 

Ser Plip Cly Gly Tyi: Arcj Asn Ala Ser Ale Tyj;^ Cys Lean Wet iVr He 
^05 310 315 320 

AEn Aj;p Lys Ala fi phe Leu He Glii Glu Glu Phi? Asn Lys Glu fJhr 
325 330 335 

Gly <3ln Pro L^u Vnl Gly He Glu Thr Leu Pj:^i> Pro Asp Leu Are Asp 
340 345 3S0 

Phe V^i, Glu Glu Aijp Ara Gin Arg Phe Glu Lyi-i 01 u Leu Glu Glu Txjy 
3S5 360 365 

Afifp AlQ Gin Leu Ala Gin Lye AIa Lism ain Ol.u Lys Leu Leu Ala £3cr 
370 375 360 

Gin Lys Leu Ai^g Glu Sex Glu Tt«r Sor Vnl Thr Thr Ala Gin Al£i Alci 

3?D 3S5 400 

Gly Afip Pro Glu 'JVr Leu Glu 03x5 Pro Ser Atq Ser Asp Phes Lys 
dOS 410 415 

His Leu Lye <3Lu Glu Thr 13 1? Cin lie He !rtir Lye Ala His Glu 

420 d25 430 

Ki» Glu A&p Lys Ser Pxr> Qixt Thr Val Leu Gin Ser Ai« Jii? Lys Leu 
^35 440 445 

Glo Tyr Ala Arg Leu VaI Ly« Lew hlet Gin Glu Asp Thr pji> Pro GLu 
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7r 



450 



T-hr Afcp Tyr Ara Leu Kjj5 Hie VaL Val Val Tyr Phe lit; RIti Asn Gin 
465 47{? ^''^ ^f^U 

Ala Pj^i T.ys Lys life J.Jc! Clu Lys 'llir L.eu L.eu Glu C31ii Ph^j Gly Asp 

d90 d^B 

Arg AisTi Ser Phe A-sp Olu Arc Cys His AEn Il^f Mtst l.ys Val Ala 

BOO B0!> 510 

Gin Aiis }.y» LfBU GLu M«l: T.I^t bys I'XC- GLu GllJ Val Jltin F.rsu tlTi Glu 
515 S2a 525 

Tyr Glu e.lu Txp His Gin As.p Tyx Arg Lvs Phe Arg Rlij? Thr IMix Met 
530 535 540 

Tyr IJi? lie Gly Xjiau fSly Asn Phe Gin Ari? Glu Stjr Tyr lie Asp 

yds 550 Sb.S bfen 

ser l^ii 1^1 The Leu lie Cys hliS: ivr Gin Asa^ Asti J.ys Clii Leu Leu 
565 570 SVh 

Sejt^ Lys <51y Leu Tyr Axg Giv Hie Asp Glu GIii T.cu lie Ber Hie Tyr 
5H0 590 

Arg Arg <51u Cy& U^u Leu Lye Leu Ann Glu Gin Ala Ala tl^ Leu Plie 

^On 605 

Glu 8c*r Glv Glu Af:p Arc Glu Val Aan Aen Gly Li^u II© lie Met Aen 
610 615 630 

Glu Plai? lie val Pica Pile Leu Pro Leu Leu Leu Vt^n Asp Olw Met Glu 
625 630 635 6iU 

Glu Ly& Asp lie LuiiA Ala Val Glu Asp Met Arg Asn Arg Trjo Cys Sex 
645 660 655 

TVr Leu Gly Gin CLu Met Glu Pri^ His Low Gin GLu Lys Leu T^hr Aftp 



Ph« Leu Pro T.y& Leu Leu Aftp Cys Sor Met Olu He Lys sex* PJiu His 
6''5 6eO 685 

Gly Pro Pn^ Lys Leu Pro Ser Tyr Ser llir Mis Glu Leu Cyf> 03xi Arg 
690 695 "700 

Pho Ala Ar«9 Tl^ Met Leu S^ixr Lou Ser Ars? O.'hr Pro Ala Asp G]y Axg 



660 



£65 



670 



705 



710 



71b 



720 



<210> 131 



72 



<400> L31 

Met lie Glij Qly Olw lie Glu Sfer T^j PJis 3^ir Glu Asn Se^r GJy Lys 
^5 ID 15 

3©r Gly Gin Glu His Trp rbe rJur 
20 



<210> 132 

'=211> 25 

<212> PRT 

<213> Homo sap>i&n» 

Plxe <J]Ta Leu £3er Arg Ph^ Glv» i*he Aan Gin Ala Lov Gly Arg Pro Gi^ 

Lyi; 13 Q Kis Asn Lyss Nt^u Gltt rhe Pro 

2S 



-^21D> 12i 

<:211> 2e; 

<2l2>' PRT 

<213> BaitK> esapiema 



<40O> 132 

CLu lie T>hj- Jijng lie Lyis Arg Glu 

Leu rfhr Vi*]. Leu Gin Gin Axg t^ai 
20 



11^ Lye Ar^ Leu Lye: Aai> Tyx 
ID 15 

Clxj Arg 
25 



<21D> 134 

<211> 27 

-c212> PRT 

<2in> HcoEko eapi.mB 



<400> 134 

Pro Lye Arg Ph^ Pro L.eu val Aep v^jI Ur.yA Gin IVr Ala Leu Glu ?he 
^ S 10 15 

Ala Scr Ser Ly* Pro Val Cys fThr Sex Pjto Vi^L 
20 25 



<210> 12^ 
<211> 26 

<213> Hono sapiens 
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I let Pri? iier Gin Tcit- Len Pro ser Thjc- Thv <"-!lv OLn Gin Gly JVln L©\i 
IS 10 LD 

S<si' S^r GlTi L&u Ptq !^er T*hr Sei^ Pirti Sor 

7.0 25 



<210> 136 
<23.J!> PUT 

<.900.-* 136 

a-er val Ho His Liyss fiu Phe l^x Gin Ser Arg Tig Pro Pro Asp l.inu 
1 f> 3<? 15 

Pro Met P-ro Ala Pri> Amg His 

20 



<210> LST 
<2ll> 23 
<212> PJIT 

<213> Homo eapl^^K 

eye Le^i His Krg TrP Arg rhr Glu J.lc Clu Asn Asi> Tlijr Ar<j A.^p Leii 
IB 10 15 

Gin Qlu Sex lic:^ Arq lie 

20 



<210> 13a 
<211> 

<212> PRT 

<213> JElocnu ;5aplenB 

-<dOO> 13B 

I.ya sex Mot lie Glxi V<»1 Pro Tyr Arg l^eu uia Ala Val i^eu Vsl Hi^ 
1 5 10 15 

Qlu <Sly Gin Ala Aan A}^ Oly His TVr rxp Ala oyr 

20 25 



■e:21D> 139 
<21L> 29 
<212> PRT 

<213> HDin:> sapa.eziB 



<400> 139 

Arg Ttp Met Lyw l-yx Aan A-Sfi Il^i Ala Val rrtir Lys Set S49r Trp Glu 
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1 !!> 10 15 

Leu V£il Arg A&p Ker Phrt fi:iy aiy ivr Arg A£iji A La 

<21D:^ IdQ 

<211> 24 

<312> PRT 

<213'> [3ciLiii> ^sQpiens 

<40O=- 14 0 

lie AEU Aftp Lys Ala Gin Phe Leu rlii filT. 01^;t Qlu Phe A^ii T.ys Glu 
^ & ID 15 

'i'hr Gly Gin Pro Leu Val Oly lie 
30 

<210> 141 
-<:231> 23 
<212> PRrr 

<213> Homo Bapi^s- 
^4D0> Idl 

MtfL lie Gin val Pro Tyr Arg 2.ic^>;i His Ala val Lftu Val His GLu Gly 
^5 ID 15 

Aloi ABl^ A.l« Civ His Tyx- 
20 

<21D> ld2 
<211> 2€ 
<2iJ!> PRT 

<dC>0> 142 

Ae5> ABi^ Glxi Aar^j Phe (Jliji Lye Glu Lei* (Jlu Glu Trp Aep Al.^a gj.n Leu 
^5 10 IS 

Ala Gin Lyg Ala Lex4 Gin Glu Lys; Leu 

<2iQ> 143 
<21l> 23 
^212> PRT 

<213> BoiTkO i&apienB 
<4D0> 143 

Glu Thjr Ser Val -Thar Thr Ala Glx. Ala Ala Gly Asp Pr« Olu 'JVr 
5 10 . 15 

Leu OLu Gin Pro Ser Arg a©r 
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7.0 



<211> 2^ 

•c213=- llOiAit^ sapiens 
<400> 14d 

Oln lie lie Thr L-yet Ai» Se^r His Glu Bis Glu Ar-p l.yr- Ser FJfCi Glij? 
1 IIJ 3S 



'llir Val L.5?u Gin Ser AJw lie Ly& Leu Glu Tyr J\l*i 
20 25 



-c210> id5 

<211> 28 

<:212^ PRT 

'c213> Himnci aapien& 

■c400;> Idb 

Leu AJ.rt Gin Glu Aap lOir tPro Pro Gl-u Thr A^p 'jyr Arg L^u Hlb Mis 
15 10 15 

val VrI Val Tyr Ph«k lie Gin Aan (31« A3?i Pro Lya 
20 25 



<210> 146 

*211> 2& 

<2i2> PRT 

<213> Homo cgpieiiB 

<40Q> 14& 

Gly Asp Arg Asn Leu Sax- Ph« Asp <Slu Arjs eye Hiss. Asn lie Met Lye 
IS 10 15 

Val Ala Gin Ala Lya Li^v Clia Met lie Lys Pri> GLu Glu 
2U 25 



<210> 147 

<5ll> 26 

<212> PET 

<213> Hfymo eapions 

-i400> 147 

Glu Qlu 0»rp His Oln Asp Tyr Arg 
L 5 

L«u lie lie Oly Leu Glu A«n Fhe 
20 



Lys t'^he Arg Glu Thr Thr Met Tyr 
10 15 



Gin Arg 
25 
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# 



<3:l0> 14S 
<:211> 29 
<212> PR-T 



<400> I4B 

lie Cys Ala Tyi- Gin Aon AiJi T.yg 
1 5 

Arg Gly His Aep mu Leu 
7,0 



Glu Leu Leu Sktt Lys Gly l^eu Tyr 

Ser HiB r/r Aj^g Aig 
2E 



<213> Hcano 5.npi«is 



<40D> 149 

Cys Leu Lftij Lys l.eu Jif^n Glu Gin Ale Al« Clu Leu Phrt <3li, s^r Gly 

C51u Aap Ajhj Csiu Val Asn Aen Gly Lrtu I I y He Met 
2CI 2& 



«2tO> 150 
<213> 3i 
<212> PRT 

«^00> 15f) 

Vol Asp {3lu Met Glu Glu Lys Asp He Ala V«1 Olv Asp Met Atg 

^ ^ iO 15 

Asn Arg iTp Cyr. £fer Tyr Lotk Oly Gin Glu M^^l: CLu Pro Hie Leu 
20 2S 30 



<310> 151 
<211> 25 
<2i2> PRT 

<213> HoniQ sapi ens. 
<4Dn> 151 

Gin Glu Ly& i^u Thr Aap Phe Leu 

Glu He Lys Ser Ph© His Glu Px'cj 

^20 



Pro Lys Leu Lev Aap Cys Ser Mot. 
10 IS 

Pro 
25 



<:21D> 152 
<211> 21 
<212> PKrr 
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<4D0> 152 

Gin He A1.J3 Thr Ser V»l His hlb Asn lie -A:;n Arg J^yB Lya Arg Ser 
1 .5 10 15 

Veil Lieu /ix'y Leu Li&u 
30 



<210> 155 

<211> 32 

<212> PRT 

<2I3> Homo ea^iumB 

<22T> SlTB 
<222> (10) 

<223> eouiils axvy the naturally occur rinci L-Ainann aciiSs 

<220> 

■c221> SJrPR 
<222> [22i 

<223> Xaa ei^unla any neiturally occurr-sing i^- amino aclci& 

<4Q0> L53 

Pro Leu Lev Arr3 Glv l^i* Phe lie Arg Xaa Airig a la Oly His Ty*' Mu 
15 10 15 

Cys val PIjc His Glu Xaj^ VrI Glu Gly Gly AHw Cys Cys Glu Gin Qys 
20 25 :^(> 



<jan^> 154 

<2I1> 4d 

c212> POT 

<2l3> Hocno sapienJ9 

«aOO> 154 

LeiA Val ABA Ajsn Ser Phe Pli^t Leoi Glu Phe He TV Arc Asp Sej- 
lb in 15 

Ly*: Asn Trp GJ.n Tyr Gin Glv Thr He liya Lye Qly Asp Leu Leu Lev 
20 2i> 20 

A&n Ars; Val Oln Lye Leu S^r Atq Val Tie A^n Met 
35 do 



<210-> 155 
•c21l> 34 
<217> PRT 
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<40&> 3 55 

IJo Arg <SLu Leu Si^r Arg Phe 21c. AIp Ala Gly Arg Lis^^ His eye Ly-i 
is 10 

33 G Asp LyB VAJ. Asn Glu rift Veil Glxi Thr Asn Arq Tyi- S^r HiE Pin- 
J?<s^r Glu 



•=r210> Itih 

<2int' 195 
<212> RRr 

<220> 
<222> (LI) 

<2Z3> icaa &i^i3«is any of th«$ iiaturzslly occurrtny X. -ami no acldt^ 
<4Q0> 156 

^3ly Sex Gin Pro Pro Gly Pro Vol Pro CiLu xaa Leu lie Aru il& Tyr 
^ ^ ID ,5 

Ser Met Arg Phe Cye? Pa^ Tyr Sar bIh Arg Thr Jir.g val Leu 

20 2?> 2D 

Ala Lys Ar.p lie Ar^g Kie <51u VaL val Aao i:ir; Aan Leu Arg Afin Lys 
3S do 

Pro Glu Trp IVx Tyir Thr Lys Hie Pro PJita RJy bIb He Pro V«1 Leu 
50 5S SO 

Glu Thr Sur Gin Cytr OLn Leu lie TV^f Glii Ser Val He Ala Cys <31u 

70 75 GO 

Tyr Leu A^p Asp Ala Tyr Pro Gly Arg Lyr. Phe Pro Ty^^ A^p Pro 

B5 90 95 

Tyr Glu Arg Ala Ax-tj Gin Lys Het Leu L*«* Q1\k Leu Phe cya r.ys VaL 
10a L05 110 

Pr-o Hlc Leu Thr Lys Glu Cys Leu v^^ Alo Leu Arg cy-. <?ly Ara Glu 

120 125 

Cya Thr Asn Leii Lye Ala Ala Lftu Arg Gin Ola Phe Sor Asn Leu Glu 

135 i40 



Glu Ho Leu Glv TVr Gin Aean Thr Thr Phe Phe Gly GJ.y Thr cya ii^ 

160 



L5C? IbS 



Ser W^t lie JLap Tyr Leu Leu rrj) Pro l^p Phe Glu Arg 



Leu A&p Val 
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165 no 

Tyr Oly lie Iieo Asp Cya Vi^l Ser His T^r Pri^ /kla Cye <3ly £3ei^ Gly 
ISO Iris l>»n 

Tyi- Gin Pro 
195 



-c210> 1&7 
Oll> 34 
<^3?.> L^JRT 

<.213> HcniLP PflpienB 
<4DD> 157 

Leu Ala Sfer Pro Phe P7-c> Val Pro L<iij Arg Cys Siir AD 9 

15 10 



<210> 
<311^ 29 
<212^ PRT 

^400> 15P 

WeL Ars t?^he cys Pro Tyr J3c:;t- Bie Arg Tlir Arg r.ipu Val Leu Lyfc A5a 
IS 10 15 

liyfc Asp lie Atg His Glu Vw.1 Val Aen lie Aejrt atct 
20 2b 



<210> 159 
<2i:>> PRT 

<ZI3> Homo 3«pi«nei 
<'dOD> 159 

Asn Lye Pro <31ti Trp Vyr Tyr Thr I^ys Hlb Pro Phe Oly His lie Pro 
15 ID l!i 

Val Leu Glu Thr S©x Gin Cycv RDx* 
20 



<2ll)> 16^3 
<21L> 24 
c212> PRT 

<2i:^> Homo ^npiene 
<400> 160 

Ijyfc Leu Ph$ Pxo Tyr Abe> Pro Tyx Glu Arg Ala Arg Gl» bys Het ijeu 
15 10 IS 



Leu IjBU Phe eye Iiy& VdlI Pro 



«0 



2D 



-:21t> 25 
<212> PRT 

<213> Homo fiapien^ 
<iO0> lb? 

V«iD. Ala i.eu Argg Cy.-j Gly at^ gIu CyA T»ir Asti beu L.ye Ala Al.fli Leu 
^ ^> 10 15 

Arg Cln Glu Ph& Asn Leu Glu Glu 

^0 25 



<210> 162 
<211> 24, 
<2I2> PRT 

<dOD> 162 

Ser Met III- Aap Tyr i^ij Trp l*ro T^p Phe Glu Axu Asp Val 

1 5 10 15 

TVr Gly lli- Leu Asp Cy.s VrI Ser 
20 



<2tO> 163 

<2ll> 60 

<212> PRT 

<.213> Hozno sapiesfif: 

<221> SITE 
<322> (15^ 

-c223> Xa& •eisuals any of the naturally oc^xix rijia L -aminD aci43^ 
<400> 163 

Ala Altfi Oly Cya Vdl I'xp Aap Thjf Gly Clys Glu Pro ills Xati 8©r 
^ & 10 IS 

Leu Arg Leu Trp l?.e Sex Ala Met Ly$ Trp Asp t^ro Tl^r Vai Cys Ala 
'^'^ 25 30 

Leu Leu Met Aap Ly« Ser lie Phe Gin Oly Ph© Leu Aan Lru Tyr Phe 
35 qo 45 

Gin Asn Asn Pro Asn Als Phe Acp Phq? Gly Leu Cya 
50 55 60 



<iH>> 164 
<2il> IfiD 
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<212> PRT 

hc2l:^> Homo sapltftJiB 

val Tyz- Leu Phe L«v Thr 'JVr Arg Gin Alia V«1 Val lie Ala Ijtii> Leu 
1 5 10 1E> 

Val liy« VrL <ily Viil. lie ifer Glu Lye nis TIit- Tirp Glu Ticp GJn llir 
■2D P.^ 30 

Val GIo Alct Val Aiii Thr Gly Leu Gin Aaf* Pho lie lie Cy& Uc Clu 
3S 4U it> 

M(Tt Phe i^BU Aid Ala lie Ala lEis^ Hn:4 Tyr Thr the Ser Tyi- I^b Pro 
SO 5.^ 60 

Tyr.- VnL OLn Glu Alci Glu Glu Gly flc^r Cys Phe Asp Sei^ Phe Ala 
e5 70 75 00 

Met Trp As3i> VaL Ser A&p rle Ar^ A^;p tep ILe sex Glu Ql-o Vol Axg 
8& DO 9E; 

Ul£^ Val tslv Arij Tlir Vial Arg Gly W.is: Pro Arg Lya LyK Jjtw. Phe Pro 
J.QO 105 110 

Glv Asp Gin Asp Gin Asn Glu Hi;* Thr S«r Leu Leu S&x- Scr Ser iter 
/ lift 120 1.25 

Gill Asp Ala Il^j Ser lie Ala 5*?r Ber Wot I'xo Pxd Ser Pro M^t Gly 
130 J3S 140 

KpE^ ivr Glrj Cly fc^he Gly iaia Thr Val Thr Pro Gin Thr Thr Pxo Thx 
1^5. 150 I3fi 160 

T^r Ala Lyis n© ser Aap Glv He Leu ser Aep Tht He Gly Olu Lyis 
165 l;o 17b 

Lye Glu Pirx> Sex 
IBO 



■c2aO> 1S5 

<2ll> 176 

<212> PRT 

<213> HoroD saplenug 

<40&3- 16b 

Thr Aen Asn Lye A&p Ser Leu Gly Trp Tyr Leu Phe Thr V&i Lp« Asp 
IS 10 IS 

Ser Trp He Alia Leu Lye Ty^ P-^*^ C?l-y '-^^^ V^l- Asp Tl^r 

20 2S 30 

eye Arg Glu Cyrj Tyr Glu Ala Tyx V©L lie Tyr Aba P3>e Met Gly Phts 
3S 40 45 
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Leu Thjr QVr l-eu Tlir Asn Arg Tyr Fro Aei^ l^u V^ai He ijeii 

50 55 60 

Glu Ala kiyifi A&p Gin GIji HiB Phfr Pro Pro Lftu c.y* Cv& Cys Pro 

65 70 75 RO 

t^ro rrp AIa Met Gly GXu V«I Leu Leu PJie Arg Cy^ £{er val 

$n ^5 

LBoi Gli^ Tyr Thr Val Vi*] Arg l>ro Phe alir nlir llw Vwi ftia Leu Jle 

IDf 105 110 

Cyfi C33v L«u Leu Gly Jlr^ Tyr Asp Glu Gly Acji P>io Sex I'he Ser Aan 

lis LlJrj 12& 

Ala Trp Thr Tyr l^u v^l lie lie Aen Aisn M<tn: S^r 01 n ueu Phe A]» 

150 115 140 

Met Tyr Cys Lieu Leu T^i Phe Tyr Lye Val L<»xi Lys Glu Cilu Leu s<;t- 

150 155 160 

Pro lie G3n Pxd vaL Gly hys yhe Leu Cys Val Xtyzi. Lesu VqI Val Phe 

1^5 no IVh 



<210> 166 

^211> 2fl 

<212> PRT 

<213:^ Homo Sapiens 



<dDQ> 166 

CJlti Asn Ser Gin Arg Thr- aiy L©u Pro He Tbr He Phis ftrg ifer 

1 5 10 15 

Elie l?ro Leu Leu 'Phr Gly 5or Abd Leu Cye Glu Acrt 
2D 2S 



<210> 167 

<2il> 9 

<212> PRT 

<213> Hojno sapien:;^ 

•c400> 167 

Gin Pho Phe Leu Oys Arg Acp Cy^ Sor 
1^ S 



<21Q> 1€8 

<2ii> 3 a 

4;212> PRT 
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<212> HE>inpi:> Ei.npiens 

□ Lu Arg 01 u Ser Cys Ser ia« J.Iil-. C3h Aln Cly VaL Gin Trf* Cy.^-^ Asn 
1 .-i 10 15 

Le\j f>Mr tor Leu Krg Pris Pip F^rp ?,T<? 01 y I'he Isys Glrj PJ^ft Sr-r: His 
20 25 30 

Leu S^ir FjBU 5'ro Ser Ser 
35 



<21D> 1G9 
c211> 116 
<212> PRT 

-c400> 16\) 

Ueu OLu Asn Lifsu Jn« Leu Ser iJer Ara Leu Glu CVs Ser Oly Alcs 
15 10 l.S 

31© Sex Ala nliz C?y& Asp Jjeu HiB Leu Leu Gly Stzr Ser Asxi ser Pris 
20 30 

Thr Ser Als Si^j- Oln Vol Val Ar-g Thr Tlir Gly AJr His His ain Ala 

4& 

Oln L^ro He PKr^ Vol Phe Leu Val Glu Thr Gly Pin? Hi& His Val Gly 
SO 55 60 

Gin Ala Hifc Ijcth Lye Gin Leu Thar Ser Arg Tyr Pro Pro dia Leu Ala 
65 75 BO 

S©r Gin Ser AIa Gly lie Thr Gly Met Ser Tyj- At-xj Tbx <31n Pro Lye: 
SB 90 6Ji 

Lflu Leu Trp Ph^i Tyr Leu Tyr Lye Gin Phe Lyft Oln Tvr Arg Glu Val 
100 105 110 

<Jly Ser Ar^ i/yB 
115 



<210> 170 
<2ll> 25 
<312> PRT 

<212> Tiofho sapiens 
<400> l'?0 

Ser S^r Arg iMm 01 u Cys Ser Gly Ala lie Ser Al^ Kie Cya Asp Leu 
1 5 10 15 



Hia Ltv Leu Gly S<-r 3er Asn Ser Fxo 
2D 



m 
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<210> 171 
<211> 410 
<212> PRT 

<400> 171 

Gly Ala Hi& Min^ Gin Ala Glii Pdtd lie Ph^ VaI Ph<3 Leu Vnl Glu Thr 
IS m Ih 

Gly Phe >3i& Hi!5 Val Cily Gin Ala uli: Li^u iys: Gin r.w Thr Aer Arg 
20 25 30 

'IVJT Pro Pro Hir> LeojL Ala Sex Gin 
^5 4D 



<210> 172 
<2iDN 25 
<212> PET 

<213> Hcuno Bapi-en^s 
^4D0> 172 

lie Thr Gly Met Eei;^ Tyr Arg Thr Gin Pro l^ya Leu Leu ^rp Pbc: Tyr 
Is 10 15 

LeLk rp^rif 'Ly^ Gin Phe liyvi <3Iti Tyr Ar^r 
20 2B 



<2.H?^ 17i 
<211> 2^ 
<21X> PRT 

<6E>0> 171^ 

Glu ABA Pho Pro Glu rhj^ Ar^9 Glu Val Ars Ala l^e Ber Pro Axg Glu 
^ 10 15 

Aen Leu <3lu Leu Cya Thjr <iyr, l^y^; Ser 
20 25 



<21fl> L74 

<2I1> 11 

•c2L2> PRT 

<2L^> Homo sapiens 

<4D0> 174 

Ale LOfu Vyx cy& Sot Ihro aer Leu Din lie A«p 
1 5 10 



<210> 175 
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<21L> 
<AQ<y> 175 

Cys His Cys Skt Ket Leu Ly-i S^n- Hir^ Gly Asp Val Gin Atji Val Leu 
15 10 15 

Thr Leii Phe V*jX Thr Val Leu Srsr Asp Val Ser ']Vr Leu Gin Oln He 
20 2!> JO 

Oln LyB Ly& l-ic;xi Arg 
35 



<2XI> 39 
c212> PRT 

^-gOO^ 176 

eye Tyj- Etie Hie GLti J.ys Ala Glu Ser A&n GJ.y Vro Glii LyB Glji OLxi 
15 10 L5 

Lye Q^u Gly Val ll« Oln Aen Phe Lys Ang Thr Lefu £fer Lya Lya QDv 

25 SO 

LyB LyK Glu Lye Ly:; F.yB LyB 
35 
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a ADDITIONAL. INUl <rA TIONS ffmv Ai/invA i/mui ^jnfjcuft^; l>ii.'c infcnrnjUion is oonlLnucd Oi> aci addiLkinal ahevt 



Ik DRf^ION ATBD 5TATRSE K)K WHICH MDIC ATION5 ARE MADE (iffSnr indlvaiitjas i^rti tmsfiiraUiiesii^Hf^l^ Smet} 



EUROPE 

In respect to th&e^ d^e'^natlons h which a Europd&n P^f^r^t la epiighl ^ seimple at the deposited 
microorganism will made available untir the publicaikin or ih« m&nlion cf Ihe g rant of Ihd European 
patent or ontfl th^ data gn which appllcatlor* has been refused or withdr-awn qr ie deemed to bs wllhdrewr^, 
only by th9 i93ue of such a sample to an axperl nominated by the peredr) requ^isting. the sample (Rule 28 [A] 
EPC). 



lliis. indaurione liEDad beUiw wi|] he aiibcniKcd lo Ehc InicfiiiicloiLal buituiu \-^icr (jpi^.g^iiiiet^itvatMa/s^^cffhthiticm^ 'AmutM^ 



Ftxrrccolving Office usconly 



13 ThiK kliBet v'aa lecci vcxJ with tlie ImcmiKlon^l flpplicaiion 



Aiitl>orJsi&d<3iniGeT 



Paraiagai SpaclaJIst 
I APD-PCr Operations 



f~| TbuKhccciA^azTKLeivisd hyihr ImcRiallonuL BTircau^in: 



ALiOionacd offlocr 
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CANADA 

The appJccant requests tlist, untrl either a Canadian patci^c lias hesji is&uc<3 on tlie hasis of an 
application or the appliw^fion has been refiased, or 3a Abwniloncd and no longer subject fo 
reinstatemenU or is wjthdra\\M. Ihc CoramissLoncr of Patents nnly auChorizcs the fiimiBhing oJ" 
a sample of the deposrifcd biological material r^^i't^rrcd lo in the applicacioci Iijp an indc^pcndent 
expect Tiommatcd by che Commissioner, t]ie applicant must, by ^ written sfalenietn, inJbnn Jhe 
International Bute*u accordingly before compklion of tccl^cal prepar^stions for publication 
of the intcmationaE application. 

NORWAY 

The applicant Kereby roquests that the application has been laid open to publie inspection (hy 
the NoTwc^gian Patent Ofl jgc), or has been finaUy decided uptm by tlie Norwe^tan Puilcnt 
Office whhoul hAving been J aid optn iTtspec£ion> the fuinishing of a sample shall only be 
effeolissd to an expert in the art. The request to this e.ffeiil. shalL be filed by tlie applicant widi 
fhc Klorwegiaji PiitcnS Offtce not l^r than at the time when the applicaCion is made available 
to the publie undej' Setirons 22 and 33(3) of the Norwegian Patciit^ Act. If sxieh a i^gucsf has 
been filed by the applicant, any request made by a (liird party for the rurDislim& of a sample 
tSxtHl indicate die expert (o be used. ITiat expert cn^iy be any person entered un Ihc list of 
reoognc75ed expcils drawn up by thcWoiwegjan Pateait Office or any person approved by 
applicant in the individual cas;e. 

AUSTRALIA 

'fhe applicant hereby gives naficc that the fijnicfibing of a sample of a microorganism shall 
only be effected prior to the gr^im of a patent, or prior to the lapBiiig, ref u^i or withdrawal of 
»be application, to ei pcrsson who [s a skiHed addressee wilhout an interest in theinvetitiun 
(llegulatiort 3.25(3) of the Austiulian Patents Regulations}. 

FINLAND 

The applitwit herehy requests that> until the application h^t? been laid open to public 
inspection (by the National EJoatd of Patents and Rcgulatioais), or has beets finally decided 
upon by the Naironal Board of Patents end Registration without having been laid open to 
public inspection^ the furnishing of a sample shall only he effected (o an expert in die art. 

UNITED KIN<;D0M 

The applicaitt hereby requests that the fumifthing of a sample of a microor&artism shall only 
be made available to art expert. The request to this effect mus\ he filed by tlie appUcwt with 
the Iiiternatioti£il Bureau hefnre the completion of the teehiiEcal preparations for the 
iimemational publication of fhe applicaiinn. 
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DENMARK 

The applkait hereby requests fliat^ untiJ di& appJicacLon has hocn laid opeai tci pubhc 
iJiRpection (by tlic Danish PAl^inl Office), or has been finaHy deci<led upon by the ]>iiJiish 
Patent office without having been laid open to pohlic inspection, die furnishing of « sample 
shall only be effected lo expert in the art. TIie request Co this effect shall he file4 by the 
app[]cant whh the Danish Patent Oflice not luler Cbnt at the time when the applicatcon es made 
WAiluble fo the public under Sections 22 and 33(3) of the Danish Patents Act. 1 Tsutth a 
request has been Hied by the applicant^ any request made by a third party for the fumishijifi of 
a sample shall indicati^ the expert to be used. TJiat expert may be any person etilcred on a list 
of recofitviTed experts drawn up by the Danish PatcnC Office or any pw^on by the applicant in 
the mdivLdual case. 

The applicant hereby requests that, utitSi the; appliwition has been laid open to pubHc 
iofipc^icin (by the Sweiiii^h Patent Office), or lias beeii ftcmlly decided upoji hy ihe Swcdisli 
Patent Offcce wilhouf having been iaid open lo public inspection, the furnishing of a «:ample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with tlie Internatrotial Bureau before the expiratimi ofld montlis from tfie priority 
date (preferably on the Form PC'imo/1 34 reprottuc^ed in aimcx Z of Volume 1 of tlie PCX 
App1a<;;ant'3 Guide). If such a request has been filed by die upplic«int any request made by ^ 
thifd party for the furnifibing of a sample J^hall indicate the expert to be used. That expert may 
be «ny person ent^^^d on a list of recognized experts dfavm up by the Swedish Pulent Oliiee 
or any person approved by a ^ipplicant in the individual case^ 

NETHERLANDS 

The applicant he«by roqucsts that until tJie date of a grant of a Netherlands patent or until the 
date on wbich the application Is rel'uscd or vvifhdrawn or lapsed^ the nrtieroorganlsm shall be 
mudc arvailabJe as? provided in the 3 tP(l ) of the Patenl Rules only by the issue of a swinipic to 
an exp^t. The request to thifi elfbet must be furnished by the *ipplicunt with die hJetherlandfi 
Industrial Property Office before the date on whi«h Ihc applccation is made available to the 
public under Section 22C or Soctlou 2S of the Patents Act of the Kingdom of the Nrtherlands^ 
whicliever of the tvwo dates occut) earlier. 
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